Gene M446_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3433 
Symbol 
ID6132086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3811176 
End bp3812564 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content72% 
IMG OID641643603 
ProductPTS system, sucrose-specific IIBC subunit 
Protein accessionYP_001770255 
Protein GI170741600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0234327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACA CGCCGAATGC CGACCAGCTC GCCGCCGAGC GGATCCTCGC CCTTGTGGGC 
GGGGCGGGCA ACGTCGTCAG CGCCGCACAC TGCGCCACGC GCCTGCGCCT CGTCCTGGCG
GATGTGGCGA AGGTCGACAA GGCCGGCCTG GAAGCGGTCG AGGCGATCAA GGGCACCTTC
CTCAACGGCG GCCAGTTCCA GATTATCATC GGGCAGGGGC GCGTCGCGCG GCTCCACGAG
GCCCTGGTGC GGGCGGGCGG CCTCACGGCG GTGTCCGCCG CGCAGGCGAA GGAGGATGCC
TCGGTCCGGC TCTCGGCGCC CCAGCGCTTC GCCCGGCTCC TCTCCAACAT CTTCGTGCCG
ATCATCCCGG TCATCGTCGC CTGCGGCCTG CTGATGGGCA CGCTCGGCAC GATCAGGACG
ATGGGCTGGC TCCCCGCGAC CTCGGCGCCG ATCCAGCTGC TCGACCTCGT CTCCAGCACC
GCCTTCATCT TCCTGCCGAT CCTGGTCGGG TTCTCGGCCG CGCGGGAATT CGGATCGAGC
CCGTTCATGG GCGCGGCGCT GGGCGGCGTC ATGATCCACC CGGTCCTCCA GAACGCCTGG
ACGGCCGGAA CCGGCATCAA GGCCTACTGG GACCTGTTCG GCCTGCCGGT GGCGCAGCTC
GGCTACCAGG GCACGGTGCT GCCGGTCCTC GTCGCCGTCT GGGTGATGGC CCTCGTGGAG
CGCGGCCTGC GCCGGGTCGT GCCCGACATG CTCGACATCG TCCTGACGCC CTTCCTCACG
CTCCTCGTCT CGGCGTTCTT CGCCCTCACG GTGCTCGGGC CGGCCGGCCG CCTCCTGGGC
GACGGCATCT CGATGGGCCT GCAGCAGCTC TACGCGCAGG GCGGTCCGCT CGCCGGCTTC
GCCTTCGGCG GGCTCTACTC GGCGATCGTC ATCACCGGCG TGCACCACAG CTTCCACGCC
ATCGAGGCCG GGCTCCTCGC CAACCCGGCG ATCGCCGCGA ATTTCCTGCT GCCGATCTGG
GCGGCCGCCA ACGTGGCCCA GGGCGGCGCG GCGCTCGCCG TGGCCCTGCG CAGCGACGAC
GCCAAGGTGA AGCAGGTCGC CGTGCCGGCG GCCCTGTCCT GCCTGCTCGG CATCACGGAG
GCCGCCATCT TCGGCGTCAA CCTGCGCTTC GTGCGGCCCT TGCTGGCCGC CGCGCTCGGC
GGCGCGGTGG GCGGGGCCTA CGTGGCGGCC GCGAAGGTGA GCATGACCGC GGTGGGCGTC
ACCGGCCTGC CCGGCATCGC CATCACGGCG CCGGGCAGCA TCCTCGACTA CCTGATCGGG
CTCGCCCTCG CCTTCGGAGT GGCGTTCGCG GCGGCCTTCG CCGCGGGCCT CCGGGCGGAG
CGGGCCTGA
 
Protein sequence
MTYTPNADQL AAERILALVG GAGNVVSAAH CATRLRLVLA DVAKVDKAGL EAVEAIKGTF 
LNGGQFQIII GQGRVARLHE ALVRAGGLTA VSAAQAKEDA SVRLSAPQRF ARLLSNIFVP
IIPVIVACGL LMGTLGTIRT MGWLPATSAP IQLLDLVSST AFIFLPILVG FSAAREFGSS
PFMGAALGGV MIHPVLQNAW TAGTGIKAYW DLFGLPVAQL GYQGTVLPVL VAVWVMALVE
RGLRRVVPDM LDIVLTPFLT LLVSAFFALT VLGPAGRLLG DGISMGLQQL YAQGGPLAGF
AFGGLYSAIV ITGVHHSFHA IEAGLLANPA IAANFLLPIW AAANVAQGGA ALAVALRSDD
AKVKQVAVPA ALSCLLGITE AAIFGVNLRF VRPLLAAALG GAVGGAYVAA AKVSMTAVGV
TGLPGIAITA PGSILDYLIG LALAFGVAFA AAFAAGLRAE RA