Gene Mnod_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1603 
Symbol 
ID7303364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1684956 
End bp1686575 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID643599337 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002496896 
Protein GI220921595 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.654936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT TCTCCCTCTC CCGCCGCCAG TTCGTGGCCG GCGCCGCGGC GCTCACGGCG 
ATGGGGCCGA CCGGGTCCGC GCTCGCCGCA GGGCCACCGT CCGGCAGCCT GACCTACGGC
ATTTCGATGT TCGACCTGCC CCTTACCACC GGGCAGCCGG ACCGGGGGGC GGGTGGCTAC
CAGTTCACCG GGCTCACCCT CTACGACCCA CTGGTGGCCT GGGAACTCGA TGTCGCCGAC
CGGCCCGGCA GACTGATCCC GGGCCTTGCC ACCTCCTGGG AGAGCGATCC CGCCGACCGG
AAGAACTGGA CCTTCCGCCT GCGCGAGGGC GTGACGTTCC ACGACGGCTC CGTCTTCGAC
GCGGATGCGG TGATCTGGAA TTTCGAGAAG GTGCTGAACG ACAAGGCCCC GCATTACGAC
CAGCGGCAGG CCTCGCAGGT GCGCCCGCGC CTGCCCTCGG TCGCCTCCTA CAGGAAGCTC
GACGCCATGA CAGTGCAGGT CACCACCAAG GCGGTCGACG CGCTGTTCCC CTACCAGATG
CTGTGGTTCC TGGTCTCCTC GCCAGCGCAG TACGAGGCGG TGGGCCGCGA CTGGACCAAG
TTCGCCTTCC AGCCTTCCGG CACCGGCCCC TACCGCATGG GCCAGCTCGT GCCGCGGGTG
CGGCTCGAAC TCGTCCCGAA CGAGACCTAC TGGAACCCCA GGCGGATGCC GAAGCTCGCC
AGGCTCACGC TGACCTGCAT CCCCGACAAT CTCGCGCGGG TCAACGCGCT CCTCAGCGGC
GACGTCGACC TCGTGGAGCT GCCCGCGCCC GATGCGGTGC CGCACCTCAA GGCGGCGGGC
ATGCGGGTGA CCGGCAACGA CACGCCGCAT GTCTGGAACT ACCATCTGTC GATGCTGGAG
GGCAGCCCGT GGCGCGACCT GCGCCTGCGC AAGGCGGCCA ACCTCGCCAT CGACCGCGAG
GGCGTGGTCG CGCTGATGGG TGGCCTCGCC ACCCCGGCAG TGGGCCAGGT GCAGCCGTCG
AGCCCCTGGT TCGGCAAGCC CTCCTTCAAG ATCGGCTACG ACATCGACAC CGCCCGCAAG
CTGATGCGGG AGGCGGGCTA CTCCCCCCAG AATCCGTTGC GCACCAGGTT CATCATCCCG
ACCGGCGGCT CGGGCCAGAT GCTGTCGCTG CCGATCAACG AGTTCGTGCA GAGTAGTTGG
GCCGAGATCG GCATCGCGCT GGAGTTCCAG CCGGTGGAGC TGGAGGTGGC CTACACGGCG
TGGCGCCAGG GCGCAGCCGA CCCGTCGCTC AGGGGCGTGA CCGGCGGCAA CATCGCGTAT
GTCACCTCCG ACCCGCTCTA CGCGATCCTG CGCTTCTACA GCTCGAAGCA GATCGCGCCG
ACCGGCGTGA ACTGGAGCCA CTACCGGAAC CCGGAGGTGG ATGCCCTCTG CGAGTCGATC
CAGGCGAGCT TCGACCCCGC CGAGCAGGAT CGGATGCTCG CCCGCATCCA CGAGATCGTG
GTGGACGACG CGGTGCAGGT CTGGGTGGTG CACGACACCA ACCCGCACGC CCTCGCGGCC
AAGGTGAAGG GCTACACCCA GGCCCAGCAC TGGTTCCAGG ACCTCACCAC CTTGGCCTGA
 
Protein sequence
MSQFSLSRRQ FVAGAAALTA MGPTGSALAA GPPSGSLTYG ISMFDLPLTT GQPDRGAGGY 
QFTGLTLYDP LVAWELDVAD RPGRLIPGLA TSWESDPADR KNWTFRLREG VTFHDGSVFD
ADAVIWNFEK VLNDKAPHYD QRQASQVRPR LPSVASYRKL DAMTVQVTTK AVDALFPYQM
LWFLVSSPAQ YEAVGRDWTK FAFQPSGTGP YRMGQLVPRV RLELVPNETY WNPRRMPKLA
RLTLTCIPDN LARVNALLSG DVDLVELPAP DAVPHLKAAG MRVTGNDTPH VWNYHLSMLE
GSPWRDLRLR KAANLAIDRE GVVALMGGLA TPAVGQVQPS SPWFGKPSFK IGYDIDTARK
LMREAGYSPQ NPLRTRFIIP TGGSGQMLSL PINEFVQSSW AEIGIALEFQ PVELEVAYTA
WRQGAADPSL RGVTGGNIAY VTSDPLYAIL RFYSSKQIAP TGVNWSHYRN PEVDALCESI
QASFDPAEQD RMLARIHEIV VDDAVQVWVV HDTNPHALAA KVKGYTQAQH WFQDLTTLA