Gene Namu_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0890 
Symbol 
ID8446482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp981044 
End bp982927 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content70% 
IMG OID645040027 
ProductPTS system, beta-glucoside-specific IIABC subunit 
Protein accessionYP_003200290 
Protein GI258651134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01995] PTS system, beta-glucoside-specific IIABC component 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG CGAGCAAGTA CGACGTACTG GCCGACGCGA TTCTCACTGG GGTGGGCGGT 
GAGTCGAACG TCAAGACCGT CGCGCACTGC GCGACCCGAC TGCGGTTCCA GCTCAACGAC
CGGTCGAAGG CGAACAAGGA GGCGGTGGAG GCGACGCCCG GGGTCATCAC CGTGGTCGAG
GCCGGCGGCC AATTCCAGGT CGTCATCGGC AACACCGTCA ACAACGTCTA CGACGCCATG
GTCGCCCGGT CCTCGGTCAG CACCGGCGGC ACGGCCTCCG GCGGCCTGCT GGGCCGGTTC
ATCGACCTGA TCACCAGCAT CTTCACTCCC CTGCTGTGGG TGCTGGCCGG CACCGGACTG
CTCAAGGCCC TGCTGGCGGT CGGCGTGAAG ATCGCCCCCG AGTTCGCCAC CACCTCGACC
TACGCGATCC TGTTCGCCGC CGGTGACGCC GCCTTCCAGT TCCTGCCGTT CCTGCTGGCC
GTCACCGCGG CCAAGAAGTT CAAGGCCAAC CCGTTCACCG CGCTGGCCGT CGTCGGGGCG
CTGGTCTACT CGGCCACCGT CGCGGTCATT CCCGGCGCGG ACGGCGTGAC CACGACCCTC
AAGGCGTTTG CCGACGGCGG GGGCGAGCTG ACCTTCCTGG GCGTCCCGGT GGTCATGGTC
AGCTACCTGT CGGCGGTCAT CCCGACCATC CTCGCGGTGT GGATCCAGTC GCTGGTCGAG
CGGTTCCTGA CCCGGGTCGT CCCGGAGACC ATCCGTAACT TCACCATCCC GCTCGTCACG
GTCGCGGTCG TCGTCCCGCT CACCTTCCTG GCCATCGGCC CCGCCTCGTA CTACCTGGGC
GATGCGCTGT CCGCCGGGGT CAACTGGCTG TGGAACCTCT CGCCGGCCCT GGGCGGTCTG
ATCCTGGGCG GGACCTGGGA ACTGATGGTG ATCTTCGGCC TGCACTGGGG TTTCGTGCCG
GTGATGATCC AGGACATTGC CACGCAGGGG TATTCACCGC TGACCGGTCC GCTGTTCCCG
GCCGTGCTGG CCATCTCCGG GGCGGCGTTC GGGGTATGGC TCAAGACCCG CAACTCCGAT
CTGCGCAAGA TCGCCGGCCC GGCCACCATC TCGGCCTTCC TGGCCGGCAT CACCGAGCCC
GCCATCTACG GCGTGGTGCT GCGGCTCAAG CGTCCGTTCA TCTTCGCGCT GATCGGCGGC
GCGGTGGGCG GCGCCATCGC CGCCTTCGGC GGGTCGGCGG CCGAAGGCTT CGTGCTGCCC
GGCGCGATCA CCCTGACCTC GACGCTGAAC ATCGGCAACT TCACCCTGCA GCTGATCGGC
TCGGCCCTGG CCATCGTCGT CGCCTTCGGC CTGACCATGG TCTTCGGCTT CAAGGACCTG
CCGAACGCGG CCGCCGACGG CCCCACCGCC ACCACCCCGG GCGAGGTGGC CTCGCAGGCC
CTGCCGGTAC AGGCGCCGGT CGCCGGGCAG GTCGTGGCTC TCGACCAGGT GCCGGACAAG
GTGTTCTCCT CCGGCGCGCT GGGCAAGGGC CTGGCCGTCA TCCCGACCGA GGGCAAGGCC
TTCGCCCCGA TCGGCGGCAC CCTGCTCACC GTGATGCCGC ACGCCTTCGG GCTGCGCGAC
GAGAACGGCC TGGAGGTGCT CGTGCACATC GGGCTGGACA CCGTCGAGCT GGGCGGCACC
CACTTCACCC CGGCGGTCAG CCAGGGCCAG CAGGTCCGGG CGGGCGACCT GCTCGGCGAG
TTCGACATCG CCGCCATCGA GCAGGCCGGC TACAACCCGA TCACGGTGAT GATCGTGACC
AACCCGGGCG CCTACCAGGC CGTCGTGCCG GTGGCCGCCG GGACCGTCGA GGCCAAGGCG
CTGGCCCTGG ACCTCGTGGG CTGA
 
Protein sequence
MTTASKYDVL ADAILTGVGG ESNVKTVAHC ATRLRFQLND RSKANKEAVE ATPGVITVVE 
AGGQFQVVIG NTVNNVYDAM VARSSVSTGG TASGGLLGRF IDLITSIFTP LLWVLAGTGL
LKALLAVGVK IAPEFATTST YAILFAAGDA AFQFLPFLLA VTAAKKFKAN PFTALAVVGA
LVYSATVAVI PGADGVTTTL KAFADGGGEL TFLGVPVVMV SYLSAVIPTI LAVWIQSLVE
RFLTRVVPET IRNFTIPLVT VAVVVPLTFL AIGPASYYLG DALSAGVNWL WNLSPALGGL
ILGGTWELMV IFGLHWGFVP VMIQDIATQG YSPLTGPLFP AVLAISGAAF GVWLKTRNSD
LRKIAGPATI SAFLAGITEP AIYGVVLRLK RPFIFALIGG AVGGAIAAFG GSAAEGFVLP
GAITLTSTLN IGNFTLQLIG SALAIVVAFG LTMVFGFKDL PNAAADGPTA TTPGEVASQA
LPVQAPVAGQ VVALDQVPDK VFSSGALGKG LAVIPTEGKA FAPIGGTLLT VMPHAFGLRD
ENGLEVLVHI GLDTVELGGT HFTPAVSQGQ QVRAGDLLGE FDIAAIEQAG YNPITVMIVT
NPGAYQAVVP VAAGTVEAKA LALDLVG