Gene M446_3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3450 
Symbol 
ID6129874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3834944 
End bp3835975 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content72% 
IMG OID641643617 
Productnitrate/sulfonate/bicarbonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001770269 
Protein GI170741614 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.302208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000774086 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGAC GCACCCTGCT CTCGCGGCGG CGGGCGGCCG CCCTGCTCGG CGGCGCCCTC 
CTGGCCGGTG CGGCCGCGCC CGGCCCCGCC GCCGCGGCCG AGGGCCGGCT GCGCATCGCC
AAGCAGTTCG GCGTCGTCTA CCTGCTCCTC GACGTGGCCC TGGAGCAGCG GCTGATCGAG
AAGCACGGCC GGGCCGCCGG GCTCGACATC GCGGTCGAGC CGGTGCAGCT CTCGGGCGGC
GCGGCGGTCA ACGACGCGCT GCTGTCCGGC AGCATCGACA TCGCCGGGGC CGGGGTCGGC
CCGCTCTTCA CCCTGTGGGA CCGCACCCGG GGCCGGCAGA ACGTCAAGGG CGTCGCCTCG
CTCGGCAACT TCCCCTACCT GCTCGTCAGC AACCGGCCGC AGGTGCGGTC GATCGCCGAC
CTGACCGAGG CGGACCGGAT CGCGCTGCCC GCGGTCGGCG TGTCGGTGCA GGCGCGGATC
CTGCAATGGG CCGCCGCCAA GCAATGGGGC GAGGCGGATT TCGCCCGGCT CGACCGGATC
AGCGTCGCGG TCCCGCATCC CGAGGCGGCG GCGGCGATCA TCAAGGGCGG CACCGAGATC
AGCGCCCATT TCGGCAACCC GCCCTTCCAG GAGCAGGAAC TGGCCGAGGC CCCGGACGCC
CGGGTGATCC TCAATTCCTA CGAGGTCCAG GGCGGCCCCG CCTCCTCGAC GGTGCTGTAC
GCGACGGAGA CGTTCTACCG CGACAGCCCC AGGACCTACC GGGCCTTCCT CGACGCCCTC
GACGAGGCGG CGACCTTCGT GGCCGCCAAC CCGGACCAGG CCGCCGAGAT CTACCTGAAG
GCCAACGGCA GCCGGATCAG CCGCGATCTC CTGCTCAAGG TGATCAGGAA CCCGGACGTG
ACCTTCAAGA TCGCGCCGCA GAACACGCTC GGCCTCGGCC GGTTCATGCA CCGCGTGGGC
GCGATCCGCA ACGAGCCGAA GGCGCTCGCG GATTACTTCT TCGCCGATCC GCGCGTGGCC
GCGGGCAGCT GA
 
Protein sequence
MIRRTLLSRR RAAALLGGAL LAGAAAPGPA AAAEGRLRIA KQFGVVYLLL DVALEQRLIE 
KHGRAAGLDI AVEPVQLSGG AAVNDALLSG SIDIAGAGVG PLFTLWDRTR GRQNVKGVAS
LGNFPYLLVS NRPQVRSIAD LTEADRIALP AVGVSVQARI LQWAAAKQWG EADFARLDRI
SVAVPHPEAA AAIIKGGTEI SAHFGNPPFQ EQELAEAPDA RVILNSYEVQ GGPASSTVLY
ATETFYRDSP RTYRAFLDAL DEAATFVAAN PDQAAEIYLK ANGSRISRDL LLKVIRNPDV
TFKIAPQNTL GLGRFMHRVG AIRNEPKALA DYFFADPRVA AGS