Gene M446_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3388 
Symbol 
ID6135310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3762110 
End bp3763504 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content78% 
IMG OID641643560 
Productprotein TolA 
Protein accessionYP_001770212 
Protein GI170741557 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.457375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.323397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTGC CCTCCCTCAA GCGGTCCGAG CCCGGAATCT GGATCTCGGG GCTGATTCAC 
GTGGCCCTGC TCGGCGCCGC GCTCTACGCC GCGGCGGCCC ACGAGCTGCC GCGGGCCGAG
GAGGGCGTGC CCGTCGAGGT CATCACCGAG AACGAGTTCT CGGAACTCAC GCGCGGCCGG
CCGGAGGGCG ACGCGCCCGC GAAGGCCCCG CGCGCCGACC GCGTCGCCGA CAAGGCGATC
GAGAAGGATC CCGGCGAGGC CAAGACCGAC GTGCCGACCC CGCCGACCCG CCCGCCCGAG
ATGAGGGTGG CGAATGCCGA GGAGCCGGTC CTGCCGCCGC TGCGGCCGGC CCTGGAGCCG
CCCGCGCCCC TGCCGCCGAC GCGCCCCGAC GACAGCGAGG CGCGCGAGCA GGCGCGGGCC
GAGGCCGCGA AGGCGGAGGC CGCCAGAGCG GAAGCGGCCA AGGCCGAAGC CGCCCGGGCG
GCCCGCGCCG AGGCCGCCAA GGCCGCCGCC GAGGCCGCGA AAGCCGCCGC CGCGAGGGCC
GCCGAGAAGG CGCAGGCCGA GGCGAAGGCG AAGGCCGAAG CGGCCCGCCG CGAGGAACTG
GCGGAGCTGA TCGCCCGCGA GGAGGCCGAG GCGAAGGAGA AGGCCGCGCA GGAGAAGGCC
CGGGCCGAGA AGGCGCGGGC GGAGAAGGCC CGCGCCGAGG CCAAGGCGCA GGCCGAGGCC
CGGGCCAAGG CCGAGGCGGA GGCCAAGGCC GAGGCGGAGG CGGAAGCCCG GGCGGAAGCG
AAGGCGAAGG CCGCCGCGGA GGCCAAGGCG GCGGCTGAGG CCAAGGCGGC CGCGGAGGCC
AAGGCGGCCG CGGAGGCCAA GGCCAAGGCC GACGCGGCCC GCGCCAAGGC GGTCGCGGAG
GCGAAAGCCA AGGCGGCGGC CGAGGCGAAG GCGCGCCGGC AGGCCGAACT CGCCAACCAG
TTCAATGCCG GCTCGATCCG CGACATGCTG GCCACCCGCG CCCCCGCCCA GGCGAGCGGC
GCCACCGGCC GCGAGGTCCA GCGCACGGCG GCCCTCGGCA CCGCCTCGGG GAGCGCGGCC
CGGCTCAGCC CGAGCCAGCG CGACGCCCTG GTCGGCCTGC TGCAGCAGCA GATCGAGCGC
TGCTACTCGG CCCCGCCCGG CGCCGCCCAG GGCGTGGTGC TGCCGCAGCT CGACATCCGG
CTCAATCCGG ACGGGTCGCT CGGGGCCGAG CCGCGCATCC TGCGGGCCGG GGGCAGCGCG
GTCGACCGCT CGATCGCTGA GGCGGCCGTG CGCGCGGTGC GCCGCTGCGC CCCCTACCGC
ATCCCCTCCC AGTTCGCGCC CTTCTACAGT GATTGGCGCG TGATCAACGC GGAGTTCGAG
CTGCCGCGGG CCTGA
 
Protein sequence
MGLPSLKRSE PGIWISGLIH VALLGAALYA AAAHELPRAE EGVPVEVITE NEFSELTRGR 
PEGDAPAKAP RADRVADKAI EKDPGEAKTD VPTPPTRPPE MRVANAEEPV LPPLRPALEP
PAPLPPTRPD DSEAREQARA EAAKAEAARA EAAKAEAARA ARAEAAKAAA EAAKAAAARA
AEKAQAEAKA KAEAARREEL AELIAREEAE AKEKAAQEKA RAEKARAEKA RAEAKAQAEA
RAKAEAEAKA EAEAEARAEA KAKAAAEAKA AAEAKAAAEA KAAAEAKAKA DAARAKAVAE
AKAKAAAEAK ARRQAELANQ FNAGSIRDML ATRAPAQASG ATGREVQRTA ALGTASGSAA
RLSPSQRDAL VGLLQQQIER CYSAPPGAAQ GVVLPQLDIR LNPDGSLGAE PRILRAGGSA
VDRSIAEAAV RAVRRCAPYR IPSQFAPFYS DWRVINAEFE LPRA