Gene M446_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3501 
Symbol 
ID6135017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3906650 
End bp3907693 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID641643672 
Productputative ABC transporter periplasmic substrate-binding protein 
Protein accessionYP_001770320 
Protein GI170741665 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00652225 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.464247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTC GTGACGGGCT GTTCTACCGA AACTTCTCCC GGCGGGGCTT CCTCGCCCGA 
AGCGCCGCCG CGGCGGCGCT CGCCCTGCCG GCCCATCTGG CGGGCCTGCT CGCGCCCGCC
GCCGCCGGGA CCGTGATCAA GGCCACCCAC GGCTCCGGCT TCTGCAACAT GGGCATCTTC
CTCGCCAAGG AGCGCGAGCT GACCAAGGCG GACGGCGTCG AACTCGACTT CGTGGTGACG
CCCTCCAACA CCGAGATCAC CACGATGTTC GGGGCGGGCC TCGTCGACAT GTCGATGATC
CCCTACTCGA ATTTCATGAC CCTCTACGAC GCGGGCGCGC CGGTGAGGAT CGTCGCGGGC
GGCGGCGTCG AGGGCTGCAT CATCGTGGCG CGCGACGGCA TCGCCTCCGC GGCCGACCTC
AAGGGCAAGA CCTTCGGCAC CTTCCAGGCC GACACGCTCG AGGTCCTGCC CTACGACTAC
CTCAAGAAGG CGGGCCTCGG CTTCCGGGAC GTCGAGATCA AGTACCTCGA CACCTCGCCC
GAACTGGCCC AGGCCTTCCT GGCCGGCGCC CTCGATGCGA TCTGCCACAT CGAGCCCTAC
GCCTCGCAAT GCGTGCGCGG CCGCAAGGGC GCGCACGTGC TCTCGGACGG GACCGACGTC
TACGGCAAGG GCTATTCCGA CTGCGTGCTC GCCGTGCGCA CGCCGCTCCT CAAGAGCAAC
CCCGCCGCCG TGAAGGCCGT CATCAAGGCC CTGTTCGTGG CCCAGGCCCA GGCCGAGGCG
GACAAGGGCG CCGCCCTCAA GGACACGGTC GGCAAGTACT ACAAGACCAG CATGGAGGCG
GCGGTCGACG CCTCCTCCAA GCAGCCGATC GTGGTGGATC AGCGCAACCA GACCCGGTTC
ATCCTGGCGC GCGGCACCTC GATGCAGGAA CTCGGCTACG TCAGGAAGGC CCCGGACGAG
GGCGCCTTCG ACTGGAGCCT GCTGGAGGCG GTGATCGCCG AGAACAAGCC CCTGTACGAC
GGGCTCAAGC TGAAATCGGC CTGA
 
Protein sequence
MCGRDGLFYR NFSRRGFLAR SAAAAALALP AHLAGLLAPA AAGTVIKATH GSGFCNMGIF 
LAKERELTKA DGVELDFVVT PSNTEITTMF GAGLVDMSMI PYSNFMTLYD AGAPVRIVAG
GGVEGCIIVA RDGIASAADL KGKTFGTFQA DTLEVLPYDY LKKAGLGFRD VEIKYLDTSP
ELAQAFLAGA LDAICHIEPY ASQCVRGRKG AHVLSDGTDV YGKGYSDCVL AVRTPLLKSN
PAAVKAVIKA LFVAQAQAEA DKGAALKDTV GKYYKTSMEA AVDASSKQPI VVDQRNQTRF
ILARGTSMQE LGYVRKAPDE GAFDWSLLEA VIAENKPLYD GLKLKSA