Gene Mnod_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1941 
Symbol 
ID7304608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2041859 
End bp2043835 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content73% 
IMG OID643599676 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_002497233 
Protein GI220921932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACG GCAAGCTCAA GCGGTTCTGG CCCCGGACTC CGTCCGACAG CGAGGCTCCG 
ACACCGGGCC GATCGCGGCG GTTGGGCAGC GATCCGACCT CGCGGCGCGT GCGCTTGACG
GACCGCTGGG ACGAGGGCTT CCTGCGCCTT CCCGAGCCCA AGGGCGTGCG GGATCTCAGT
TCCATCGTCG ATGGGCTCGG CATCTACTAC GACGCCACCG CCCCGAGCGA GCTCGAGATC
ATGCTCGAAG AAGGGGGCTG GGAAACGCCC GAGATCCTGG CCCGGGCCGG CGCCTGCATC
GCACGCCTGC GCACTGAGCG GCTGAGCCTC GACAACGACC CCCGCCGACG CGCACTCTCC
GACCTCATCG GCGCGCCGGC GCCCGGGATC CGGCGCGTCG CCATCATCGA TCAGGCCCGC
ACCGATCCGA CGATCGGCTT CGGGCTCGCC GGCGCGACCT GCTTCAGCGC CATGCTGGCA
GCGGCAGCGG CCGAGCATCC CGGCGCCGAG CGCGTCGTGG TGATGGATCC GTCCGCGCCC
TTCGGCGCTG CGGGCCATAT CGGCGCCGAG GACGCGCAGC GGCACGGGGC CCGCCTCGTG
ACCGAGCCCG TCTCGGCCTG GTCGCTGGCG GATGCCTGCG ACCGCCTGTT CGTCGTCACA
AGCCATGTTG GCTTCGAGGC GTCGCTGGCG GGCCGGTCCG TCACCTGTTT CGGGCTGCCG
TTCTACGCTG GATGGGGCTT CACGGACGAC CGGCTCCACC TCGCCCGCCG CTCGCGCCGG
CGCCGGCCCG AGGAGGTCTT CGCCGCCGCC TACATCGTCT GCTCGCGCTA CTTCGACCCC
TATGGCGACG AGGCCTGCCG GCTGGAGGAC GCCCTCGACG TCCTGTCCCT GGTGGTCGCC
CGGCAGCGCG AGAACGCGGC CCGCACGCTC TGCCTCGGCT TCTCCGCCTG GAAGCGGCGC
TCCGTCTCCG AGACCTTCGC GTCGCCCGGC AACCGGCCGG TGATCGCGCG CCCCATGGAG
CGCGTCTCGG CCGCGGATCT CCAGGGCTTC GAGCGGGTCA TCGCCTGGGC CAGCCGGATG
CCGGACGGCA CCGAGGCGAC CTGCCGCGAG GCCGGTCTGC CACTGCTGCG GATGGAGGAC
GGCTTCCTGC GCTCGATCGG GCTCGGCGTC GCCCTGCGCC CCGGCGCCTC GCATGTGCTC
GACCGGAGCG GGGTCTATTA CGACGCCACG CGTCCGAGCG ACCTGGAGGA GATGCTTCAG
ACCGCACGGT TCGACGAGGC GATGCTCGCC CGGGCCGCGC GGCTGCGCGA GGCGATCGTC
GCGGCCCGGG TCAGCAAGTA CAATGTCGGA GGGGCACCGA TGCCCCAGCC CCCGCGGCCG
GGCCCCGTGG TGCTGGTGGC CGGTCAGGTC GAGAATGACG CCTCGATCCG CCTCGGCACG
CTCGACCTGC GCACCAACGC CGCCCTGCTG CGCAAGGCCC GGGAGCGCCA CCCGGAGGCG
ATCATCGCCT TCAAGCCGCA TCCGGATGTC GAGGCGGGGC TGCGCCCCGG CTCGGTGCCG
CCGGAGGAAC TGGCGGCCCA TGCCGACATG GTGCTGCGCG ACGTCTCGGC AGCGGACGCG
ATCGATGCGG CCGACCATGT CGAGACGCTG ACCTCGCTGA TCGGCTTCGA GGCGCTGCTG
CGGGGCAAGA CCGTGACCAC CCACGGCTTG CCCTTCTATG CCGGCTGGGG CCTCAGCGAG
AGCCCGCCCT GCCCGCGCCG CACCCGCCGG CTCACCCTCG ACGAGCTGGT GGCGGGCGCA
TTGATCGCCT ATCCGCACTA CATTGATCCC CGCACCGGCC TGCCCTGCTC GCCGGAGGTA
CTGGTGCGCC GGCTTGCCGA GGGCGATCCC GCCCTCACGC GGCGCACGCT CACGCCCGAG
GCGGTGATGA AGCAGGTCTG GTCCCTGCTC TGGCGCCACG TCCTGCACCG CGGGTGA
 
Protein sequence
MDHGKLKRFW PRTPSDSEAP TPGRSRRLGS DPTSRRVRLT DRWDEGFLRL PEPKGVRDLS 
SIVDGLGIYY DATAPSELEI MLEEGGWETP EILARAGACI ARLRTERLSL DNDPRRRALS
DLIGAPAPGI RRVAIIDQAR TDPTIGFGLA GATCFSAMLA AAAAEHPGAE RVVVMDPSAP
FGAAGHIGAE DAQRHGARLV TEPVSAWSLA DACDRLFVVT SHVGFEASLA GRSVTCFGLP
FYAGWGFTDD RLHLARRSRR RRPEEVFAAA YIVCSRYFDP YGDEACRLED ALDVLSLVVA
RQRENAARTL CLGFSAWKRR SVSETFASPG NRPVIARPME RVSAADLQGF ERVIAWASRM
PDGTEATCRE AGLPLLRMED GFLRSIGLGV ALRPGASHVL DRSGVYYDAT RPSDLEEMLQ
TARFDEAMLA RAARLREAIV AARVSKYNVG GAPMPQPPRP GPVVLVAGQV ENDASIRLGT
LDLRTNAALL RKARERHPEA IIAFKPHPDV EAGLRPGSVP PEELAAHADM VLRDVSAADA
IDAADHVETL TSLIGFEALL RGKTVTTHGL PFYAGWGLSE SPPCPRRTRR LTLDELVAGA
LIAYPHYIDP RTGLPCSPEV LVRRLAEGDP ALTRRTLTPE AVMKQVWSLL WRHVLHRG