Gene M446_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1457 
Symbol 
ID6133047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1600562 
End bp1602292 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content57% 
IMG OID641641732 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001768401 
Protein GI170739746 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAC AAGCCAGCCG CCTGACCACG CAGCTGCCGA TATTGCGAGA TCGCGGAGTT 
AATCTCCGCG AAATACTGAG CGTGTTGAAA CGCCGTCGGG CTATTTTCCT GGTTGTTACC
TCGATCGTTC TATTTTCCGT GATAGCCTAC CTGTTCATCG CACGTCCTAC GTTTACCGCA
ACAGCGCAGA TCCTTCTCGA CCAGCAGCAG AGAATCGCCG ACGATGTTCC CAGGGAGCAG
CTACCGTCAG AAACTGTAAG CGCGATCGTC GAGAGCCAAG TTCAGACAGT GGCGTCCCAC
GAAATTGTGA GGCGTCTTAT CGCAAGCGAG CATCTAACCA GCGATCCGGA ATTTGCTTCG
CGGGGACTTG TATTGCAAAT ATTGCATTCC GCTTTGGGCA TGATCGGCAC AGCGGCGTAC
GAGGAAGGAG ATCCAGAAGC GCGCGTACAC CAAAATGTTC GCAACGCGAT CTCAGCACGA
AGGCTTGATA AAACTTTTGT TCTCGAGATC AGCTTTGAAT CGCATGATCG GCAAAAATCT
GCGCGGCTCG CGAACGCCAC CGCGCGCGCC TTTATCGCCG ACCAGGTTGA GGCGAAAGTG
GCCGCGAATC GCCGTCTTGC GGCGTCGTAT GAGGCGCGTC TACCGGAGTT GCGAGGGGAA
TTGCAACGAG CCGAACAGGC GATTGAGACT TACAAGTTTC AGCACAGTAC GGTCGTTCCC
TCCGGCGGCC CGGCCACGCT CGGCGGAAAT GAGGCCCTTG TCGGGCTGAG GGAACTCGAG
CGGGAAGCAG AGACGAGTCG CGCCCTGTAT GTCTCGCAAC TGGCACGTTC GAGGAAGGCG
TTCGAACAAG CGAATTTCTA CGTTGCTGAC GCACGCTTCA TTTCCCCCGC AATCCCGCCA
GCGCGCCGCA GCTGGCCGCC GACGGGAGTG TTGCTCGTTG CTGGTCTCTT TGGTGGCATG
AGCGTGGCCG CGGGCGTCGC GTTGCTGCGC GATCATTTGG ACACTCGCCT CTTTACGAAG
GAACAAACCG AGTCGGAAAC GGAGTTTCCG GTCCTCGCCG ACATACCAGA AGCTCGGCCA
AATTCACCCG ACATCGCGCG TTGCAATCAG GGTGCGTTCC TGCGTATCTT GGACTCTGTA
CGCGAGCATT CAGAGAGAAA AAGCACCAGA ATTATACTTC TTACTTCGTC CGAACTGGGG
GAGGGTAAAA GTACAATAGC TATAAACCTG GCGATGATTG CTGATAAACT CGGAGATAGC
GTCCTGCTCG TTGACTCGCC GTTCGCCACG ACGGCGACGT CGGTGGCGGG GGAGCACATC
TGGTTCGTCG ACTCGCCCTT CATCGTGCGA GCGGCGCTCT TGCCGTCGAG TGCCGGGATC
AAGGCCACGA GTCAGAACGG AGAGGCGGCG CATCGTCAGG CGGCGCATCA CCCACTCGTG
CTACAAGGCG ACTCCGCACG AAGAACCAGC CTACGAGACC AAATCGAATT TTTGCTGAAT
TCGTCGACGC GAAAATTCGA TCTCATCATC TTGGAGCGAA GCGCGGCCAA CGACGACTGC
GTTCTCCGTG ACATGAGCCA CATCGCACAC TCGATCATCA TTGTGGCGAA AGCCGGTCGA
ACTCGGGTCG GCGACATTGC TTCGATCTCC GAAACACTTG GATTGGGACG CAAGCGAATC
GCTGGTGTGG TACTTAACCG CACTCGGCGG AAGTCGAGGT TGCTGCCGTG A
 
Protein sequence
MSLQASRLTT QLPILRDRGV NLREILSVLK RRRAIFLVVT SIVLFSVIAY LFIARPTFTA 
TAQILLDQQQ RIADDVPREQ LPSETVSAIV ESQVQTVASH EIVRRLIASE HLTSDPEFAS
RGLVLQILHS ALGMIGTAAY EEGDPEARVH QNVRNAISAR RLDKTFVLEI SFESHDRQKS
ARLANATARA FIADQVEAKV AANRRLAASY EARLPELRGE LQRAEQAIET YKFQHSTVVP
SGGPATLGGN EALVGLRELE REAETSRALY VSQLARSRKA FEQANFYVAD ARFISPAIPP
ARRSWPPTGV LLVAGLFGGM SVAAGVALLR DHLDTRLFTK EQTESETEFP VLADIPEARP
NSPDIARCNQ GAFLRILDSV REHSERKSTR IILLTSSELG EGKSTIAINL AMIADKLGDS
VLLVDSPFAT TATSVAGEHI WFVDSPFIVR AALLPSSAGI KATSQNGEAA HRQAAHHPLV
LQGDSARRTS LRDQIEFLLN SSTRKFDLII LERSAANDDC VLRDMSHIAH SIIIVAKAGR
TRVGDIASIS ETLGLGRKRI AGVVLNRTRR KSRLLP