Gene Mnod_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_2914 
Symbol 
ID7304086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3006873 
End bp3008486 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content67% 
IMG OID643600622 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002498168 
Protein GI220922866 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC ACTTCGTCAC CAACTGGACG CGTCTCGACG ATGCCGCGGT CGAGGACGCG 
ATCCGCCGGG GTGCCTCCCG CCGCGAACTC CTGCGCGCGC TCATGCTCGG GGGCGTCGCG
GCGTCGACCG GTACGGCGAT CCTCGGGCGC GCGACCGCAG CGCTCGCCGC GACTCCGCGC
CCGGGCGGCG CATTGCGGGC CGCCGGCTGG TCCTCCTCGA CCGCCGATAC CCTCGATCCG
GCCAAGGCGT CTCTCTCCAC CGATTACGTG CGCTGCTGCG CCTTCTACAA CCGCCTCACC
GTGCTCGACG CCGCCGGCAA CGTCCAGATG GAGCTGGCCG AGAGCATCGA GACCAGGGAC
GCCAAGGTCT GGACGGTGAA GCTGCGCAAG GGCGTCACCT TCCACAACGG CAAGGCGCTC
ACCTCCGCCG ATGTCATCTA TTCCCTCAAG CGCCACCTCG ATCCGGCCGT CGGCTCGAAG
GCGAATGCGC TCGCCAAGCA GATGACCGGC TTCAAGGCGC CGGACGACCT CACCGTCGAG
ATCACGCTGG CGAACCCGAA TGCCGACCTG CCGACCATCC TCGCCACCCA CCACTTCATG
ATCCTGGCGG ACGGCACCAC GAATTTCGCC AAGGCCAACG GCACCGGTGC CTTCACCTGC
GAGGTCTTCG AGCCCGGCAT GCGCTCGGTC GGCCTCAAGA ACAAGCACTA CTGGAAGGCG
AGCGGGCCCT ACCTCGATTC CTTCGAGTTC TTCGCGATCC CTGACGATTC GGCGCGCGTG
AACGCGCTGC TCTCGGGCGA CATCCAGCTC GCGGCCGCGA TCAACCCGCG CTCGATGCGG
CTCGTGGAGA GCCAGCCGGG CGTCGTGCTC TCGAAGACGA CCTCGGGCAA CTACACCGAC
TTGAACATGC GGCTCGACAT GGCGCCCGGC GACAAGGCCG GCTTCGTGGA GGGCATGAAG
CACCTGCTCA ACCGGCCGCT CATTCAGAAA TCGGCGCTGC GCGGTCTTGC CGAGATCGCC
AACGACCAGC CGGTGCCGCC CTCGAGCCGC TACCACAATC CCGAGGTGAA GCCGCGCGCC
TTCGACCCCG ACAAAGCGAA GTTCCTCCTC GGCAAGGCCG GCGTGCTCGG CCAGACCATC
CCGGTCATCG CCTCGGACGC CGCGAACTCC GCGGTCGACA TGGCGACCCT GCTCCAGCAG
GCGGCGGCCG GGATCGGCCT GAAGCTCGAC ATCCAGCGCG TCCCCTCGGA TGGCTACTGG
TCGAACTACT GGCTGAAGGC CCCGATCCAC TTCGGCAACG TGAATCCGCG GCCGACGCCC
GACATCCTGT TCTCGCTGTT CTACGCCTCG GAAGCCCCCT GGAACGAGAG CCGCTACAAG
TCCGAGACAT TCGACCGGAT GCTGATCGAG GCGCGCGGCC TCCTCGACGA GACCAAGCGC
AAGGCGATCT ACGGCGAGAT GCAGGCGATG ATCGCGAACG AGGCGGGCAC CGCCATCCCG
GTCTACATCT CGAACGTCGA CGCCCACTCG GCGAAGCTGA AGGGTCTGCA GCCGAGTCCC
CTCGGCGGCA TGATGGGCTA CGCCTTCGCG GAATACGTCT GGTTCGAGGC CTGA
 
Protein sequence
MTDHFVTNWT RLDDAAVEDA IRRGASRREL LRALMLGGVA ASTGTAILGR ATAALAATPR 
PGGALRAAGW SSSTADTLDP AKASLSTDYV RCCAFYNRLT VLDAAGNVQM ELAESIETRD
AKVWTVKLRK GVTFHNGKAL TSADVIYSLK RHLDPAVGSK ANALAKQMTG FKAPDDLTVE
ITLANPNADL PTILATHHFM ILADGTTNFA KANGTGAFTC EVFEPGMRSV GLKNKHYWKA
SGPYLDSFEF FAIPDDSARV NALLSGDIQL AAAINPRSMR LVESQPGVVL SKTTSGNYTD
LNMRLDMAPG DKAGFVEGMK HLLNRPLIQK SALRGLAEIA NDQPVPPSSR YHNPEVKPRA
FDPDKAKFLL GKAGVLGQTI PVIASDAANS AVDMATLLQQ AAAGIGLKLD IQRVPSDGYW
SNYWLKAPIH FGNVNPRPTP DILFSLFYAS EAPWNESRYK SETFDRMLIE ARGLLDETKR
KAIYGEMQAM IANEAGTAIP VYISNVDAHS AKLKGLQPSP LGGMMGYAFA EYVWFEA