Gene M446_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2649 
Symbol 
ID6135343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2940751 
End bp2941758 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content74% 
IMG OID641642863 
Productbile acid:sodium symporter 
Protein accessionYP_001769522 
Protein GI170740867 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.19525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC GCTTCCGCCC CGATCCCTTC ATGCTGATGC TCCTCGCCTG CCTGCTCCTC 
GGCGCGTTCC TGCCGGTGAG CGGCGGGCTC GCGGAGGGGC TCGGCAGCGT GGCGACCGGC
GCGATCGCGC TCCTGTTCTT CCTGCACGGC GCCCGCATCG ACCGGCGCAC GGCCCTGGCC
GGGCTCGTCC ATTGGCGGCT CCACCTCGTG GTGCTGGCGA CGACGTTCGG GCTCTTCCCG
CTCCTCGGCC TCGCGGCGGG CCTGCTCGCG CCGAGCCTGC TGACGCCGGC CCTCGCGGCG
GGCGTGCTGT TCCTGTGCGT CCTGCCCTCG ACCGTGCAAT CGTCGATCGC CTTCACCTCG
GTGGCGGGCG GCAACGTGCC GGCGGCGGTC TGCGCCGCCT CGGCCTCGAA CATCCTCGGC
ATGGTCCTGA CGCCGCTCCT GGCCTCCCTG CTGTTCCGGG CCCAGGGCGC CTTCGACTGG
TCCGGGGCGG GCAAGGTCCT CCTGCAGCTG CTCGCGCCCT TCCTGCTCGG GCAGCTGCTG
CGGCCGCGGC TCGCGCCGCT CCTGGCCTCC CGCAAGGGGG TGACCGCCCT CGTCGACCGC
GGCTCGATCC TGCTCGTCGT CTACCTCGCC TTCAGCCACG CCAGCGCGAG CGGGCTGTGG
TCGCGCACGC CGCTGCCGGC GCTCGCCACC ATGCTGCTCG TCGACGGGAT CCTGCTCGCG
AGCGTGCTTG CCCTGACCGC GGCCGCGAGC CGGCTGCTCG GCTTCTCGCG GGCGGACGAG
ATCACCATCG TGTTCTGCGG CTCGAAGAAG AGCCTCGTGG CCGGCGTGCC GATGGCGAAC
GTCCTCTTCG CCGGGCAGGA TGTCGGGGGT CTGCTCCTCC CGGTGATGCT GTTCCACCAG
ATCCAGATCG CGGCCTGCGC CGCCCTGGCC CGGCGCTACG CCGCCCGCGG CGGGAGCTAT
CGAACGGCCC CGGCGGCGCT TTCCGCGCTT CCGCTGGCGT CCCGTTGA
 
Protein sequence
MRARFRPDPF MLMLLACLLL GAFLPVSGGL AEGLGSVATG AIALLFFLHG ARIDRRTALA 
GLVHWRLHLV VLATTFGLFP LLGLAAGLLA PSLLTPALAA GVLFLCVLPS TVQSSIAFTS
VAGGNVPAAV CAASASNILG MVLTPLLASL LFRAQGAFDW SGAGKVLLQL LAPFLLGQLL
RPRLAPLLAS RKGVTALVDR GSILLVVYLA FSHASASGLW SRTPLPALAT MLLVDGILLA
SVLALTAAAS RLLGFSRADE ITIVFCGSKK SLVAGVPMAN VLFAGQDVGG LLLPVMLFHQ
IQIAACAALA RRYAARGGSY RTAPAALSAL PLASR