Gene Mnod_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1413 
Symbol 
ID7308165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1496870 
End bp1498180 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID643599154 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002496715 
Protein GI220921414 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCCT TCGATCGACG CTCGATCCTC AAGGGAGGGG CCGCCCTCGG CTTCGCCGCG 
GCCTCCGGCC TCGACGGGTT CGCGCGCGCC TGGGCGCAGG AGAACCAGTG GAAGCCGGAG
CCCGGCGCCT CGCTCAAGCT CTTGCGCTGG AAGCGCTTCA TCCCGTCGGA GGACGAGGCT
TTCATGCGCC TCGTCGACGC CTTCACCAAG GCGACGGGCG TGCCGGTGAG CGTCACCAGC
GAGTCCTTCG ACGACATCCA GCCCAAGGCC TCAGTCGCGG CCAATACGGG CCAGGGTCCC
GACATGGTCT GGGGCCTCTA CTCCTTCCCG GCCCTGTTCC CGTCGAAGTG CCTCGAGGTC
GGCGACGTCG CGGACTATCT CGGCAAGAAA TACGGCGGCT GGGTGCCGGC GGCCGAAGCC
TACGGCAAGG TGAAGGGCAA GTGGATCGCG ATCCCGATGG CCTTCAACGG CGGCTACATC
AACTACCGCA TCTCGGCCGT GCAGAAGGCC GGGTTCAGCA AGGTGCCGGA GGATCTCGAC
GGCTTCCTCG AACTCTGCCG GGCCCTGAAG AAGAACAACA CGCCGGCCGG ATTCGCGCTC
GGCCACGCCA CGGGTGACGG CAATTCCTGG GCGCATTGGG CACTCTGGTC GCACGACGCC
TACTTGGTCG ATGCCAACGA GAAGATCATC ATCAACTCGC CGGAAACCGC CAAGGCGCTC
GAATACGTCA AGAACCTCTA TCAGACGTTC ATTCCCGGCA CCGTCTCGTG GAACGATTCC
TCGAACAACA AGGCGTTCCT GTCCGGTGAG CTCTACCTGA CGAACAACGG CATCTCGATC
TATGCCGCGG CGAAGACCGA GCGGAAGGAC ATCGCCGAGG ACATGGACCA CGCGGTCTAC
CCGGTCGGCA AGTCCGGCAA GCCGACCGAG TTCCAGCTCG CCTTCCCGAT CCTGGCCTAC
ACCTACACGA AGGCGCCGAA CGCCTGCAAA GCCTTCATGG CCTTCGCGCT GGAGGCGCAG
AACTACAATC CGTGGCTGGA AGCGGCGCAG GGCTACCTCT GCCACCCGCT GAACGCCTAC
GCCAACAACC CGATCTGGAC CGCCGACCCG AAGAACAAGG TGTTTCGCGA GGCCTCGGTC
CGCACGCTCG CGGCGGGCGG CCTCGCCCCG GTGAGCGAGA AGGTGGCGGC CGTCCTCGCC
GACTTCGTCG TCGTCGACAT GTTCGCCGCC TACTGCACCG GCCGCGAGGA CGTGAAGGGC
GCCATCCGCA CGGCGGAGCG GCAGGCCCAG CGCATCTTCC GCTCGGCCTG A
 
Protein sequence
MTSFDRRSIL KGGAALGFAA ASGLDGFARA WAQENQWKPE PGASLKLLRW KRFIPSEDEA 
FMRLVDAFTK ATGVPVSVTS ESFDDIQPKA SVAANTGQGP DMVWGLYSFP ALFPSKCLEV
GDVADYLGKK YGGWVPAAEA YGKVKGKWIA IPMAFNGGYI NYRISAVQKA GFSKVPEDLD
GFLELCRALK KNNTPAGFAL GHATGDGNSW AHWALWSHDA YLVDANEKII INSPETAKAL
EYVKNLYQTF IPGTVSWNDS SNNKAFLSGE LYLTNNGISI YAAAKTERKD IAEDMDHAVY
PVGKSGKPTE FQLAFPILAY TYTKAPNACK AFMAFALEAQ NYNPWLEAAQ GYLCHPLNAY
ANNPIWTADP KNKVFREASV RTLAAGGLAP VSEKVAAVLA DFVVVDMFAA YCTGREDVKG
AIRTAERQAQ RIFRSA