Gene M446_3911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3911 
Symbol 
ID6132995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4357761 
End bp4359425 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content73% 
IMG OID641644069 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001770711 
Protein GI170742056 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0942004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCGC GCCTCCTCCC GGCCCTGGTG CCGCTCGCCC TCGCGGCGGG ACCGGCGGCG 
GCCGACGCGC TGTCGGGCCC GTCGAGCCGC CAGGCCCTGA ACCCGGTCGC CATCGGCCTG
TTCCTCGCCT TCGTGGCCGT GACCCTCGCC GTCACAATCC GGGCGGCCCG GCGCGGCACC
CGCACGGCGA GCGACTTCTA CGCCGCGGGC GGCTCGCTCG GCGGCGTCCA GAACGGGCTC
GCCATCGCGG GCGACTACAC GTCCGCGGCC ACGTTCCTCG GAGTGACGGC GCTCGTCTAC
GGCTCGGGCT ACGACGGGAT GATCTACGCG GTCGGCTTCC TGGTCGGCTT CCCGATGATC
CTGTTCCTGA TCGCCGAACC CCTGCGCAAT CTCGGCCGCT ACACCTTCGC GGACGTGGCC
GCCTACCGGC TCGCCGAGAT CCCCGTGCGG CTGGTGGCGG GGCTCAACAC CCTGGTGATC
GTGCTCCTCT ACCTGATCGC CCAGATGGTC GGGGCCGGCA AGCTGATCGA GCTCCTGTTC
GGCCTGCCCT ACGCCACGGC GGTGGCGCTG GTGGGCGTGC TGATGATGCT CTACGTCGCC
TTCGGCGGCA TGCGGGCGAC CACCTGGGTG CAGATCATCA AGGCGGTGCT CCTGCTTGCC
GGGACGGCGC TGATGGCGGC GCTCATCCTC GCCCGGTTCG GCTTCAGCCT GGAGGCCCTG
TTCGCGCGGG CCATCGCGCT GCATCCCAAG AGCCGCGCCA TCATGGCGCC GGGCGGGCTC
GTGCGGGATC CGGTCTCCGC GCTCTCGCTC GGCCTTGCGC TGATCTTCGG CACCGCCGGC
CTGCCGCACA TCCTGATGCG GTTCTTCACC GTGGCGGATG CGCGCGAGGC GCGGCTCTCG
GTGCTGGTCG CCACCGGCTT CATCACGGTG TTCTACACGC TGCTCTTCGT GCTCGGCTTC
GGGGCGATCG CCCTCGTGCT CGGCGAGCCC GCCTTCACGG ACGCGGCCGG CCGCCTGATC
GGCGGCCCGA ACATGGTGGC GCTTCACCTC GCCGACCGGC TCGGGGGCGC CCCGCTCCTC
GGCTTCATCT CGGCCGTCGC CTTCGCGACC ATCCTGGCGG TGGTCTCCGG GCTCGCGATC
GCCGGGGCCT CGGCGGCGAG CCACGACCTC TACGCCCGCG TCCTGCGGCG CGGCCGGGCG
AGCGAGGCGG AGGAGGTGCG CGTCTCGAAG GGCGCGGCGG TGGCGATCAG CCTCGCCGCC
ATGGCCCTCG GCCTCGCCTT CGAGAACCAG AACATCGCCT TCCTGGTCGG GCTCGTCTTC
GCCATCGCGG CGAGCGCCAA TTTCCCGGTG ATCGTGCTCT CGGTCTCCTG GCCGGGGCTG
ACCACGCGGG GAGCCGTCGC GGGCTCGCTC GCCGGGCTCC TGTGCGCGCT CGGCCTGATG
ATCCTCGGCC CGGGCGTCTG GACCGCCGTG CTCGGCCTCG GGGCGGCCCC GTTCCCCTAC
GACAACCCCG CGCTCTTCTC CGTGCCGCTC GCCTTCGTGA CCGCGGTGGC GGTCTCGCGC
CTCGACCGCA GCGCGGCCGC CCGCGCCGTC CGCGCGGCCT ACCGCGCCCA GCACGTCACC
GCCCAGACCG GCTTCGACCG GCCCCGGCCG GTCGCCGCCC ACTGA
 
Protein sequence
MRPRLLPALV PLALAAGPAA ADALSGPSSR QALNPVAIGL FLAFVAVTLA VTIRAARRGT 
RTASDFYAAG GSLGGVQNGL AIAGDYTSAA TFLGVTALVY GSGYDGMIYA VGFLVGFPMI
LFLIAEPLRN LGRYTFADVA AYRLAEIPVR LVAGLNTLVI VLLYLIAQMV GAGKLIELLF
GLPYATAVAL VGVLMMLYVA FGGMRATTWV QIIKAVLLLA GTALMAALIL ARFGFSLEAL
FARAIALHPK SRAIMAPGGL VRDPVSALSL GLALIFGTAG LPHILMRFFT VADAREARLS
VLVATGFITV FYTLLFVLGF GAIALVLGEP AFTDAAGRLI GGPNMVALHL ADRLGGAPLL
GFISAVAFAT ILAVVSGLAI AGASAASHDL YARVLRRGRA SEAEEVRVSK GAAVAISLAA
MALGLAFENQ NIAFLVGLVF AIAASANFPV IVLSVSWPGL TTRGAVAGSL AGLLCALGLM
ILGPGVWTAV LGLGAAPFPY DNPALFSVPL AFVTAVAVSR LDRSAAARAV RAAYRAQHVT
AQTGFDRPRP VAAH