Gene M446_4507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4507 
Symbol 
ID6134304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4963993 
End bp4964973 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID641644647 
ProductABC nitrate/sulfonate/bicarbonate transporter, periplasmic ligand binding protein 
Protein accessionYP_001771282 
Protein GI170742627 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.384505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAC GCCTCGCGGC CCTCGCCGCG TTCATCGCGG CCTGGGCCGG ACCCGCCTCC 
GCCGAGGACG TGGTCCGCCT CGGCAACCTG AAATTCGCCC ATTACGGCGC CATCTCCTAC
ATGAAGGAGA TCGCGCCCAA ATACGGAATC CGCATCGACG AGAAGGTCTT CGCCAAGGGC
GCCGACATCT ACCCGGCGAT GGCGGTCGAC CAGATCGACA TCTCGGCCTC CGGCGCCGAC
GGGGCGGTGG CGGCGCGCGG CAACGGCGTG AAGCTGCTGG TCGTGGCGGG CTTCGCCAAT
GGCGGGGTGC GCATCCTCGG CCGGCCGGAC CTCGGCGCCA AGACCCTCGC GGACATCAAG
GGCAAGAAGG TCGCCACCGT CCGGGGCGGC ACGCAGGACC TGATGCTGCT CGCCGAACTG
GAGAAGAACG GCCTGACCTG GTCAGACCGT CCCGGCAAGG ACGTGCAGTT GATCTACTTC
AACAACTACG CCGACCTGAA TCAGGCTCTC GCCCAGAAAT ACGTCGACGT GATCTGCCAG
AGCGAGCCGC AATCGACGCA GGCGATCTCG GCCGGCTGGG GCACCGAGAT CGTCAAGCCC
TACGACACCC CGGTCGGCAT TCCCTACCGG CCGCTCGTCA TGACCGAGAA GATGTATGCC
GAGAAGCCGG ACGTGGCGGC CCGGGTGCTC AAGGTCTTCG TCGAGGCGAC CAAGACCTTC
ATCGAGAAGC CGGACCTCGC CGAGAAGTAC GTGCGCGAGC AGGTCTTCAA GGGACAGCTC
TCGTCCCAGG ACTACAAGGA CGCCATGACG AACGCGGCCT TCACGTACGA TATCCCGGCC
GGCCACATGC AGGTCACGGC CGACCTGATG CACAAGTACG GGCTCGGCAA GATGGTGAGC
CCGCCGAAGA GCGACGCGGA GTGGGTGAAG CTCGACCTTC TGGAGAAGGC CAAGGCGGAG
CTGGGTGCCA AGACCAATTA G
 
Protein sequence
MLTRLAALAA FIAAWAGPAS AEDVVRLGNL KFAHYGAISY MKEIAPKYGI RIDEKVFAKG 
ADIYPAMAVD QIDISASGAD GAVAARGNGV KLLVVAGFAN GGVRILGRPD LGAKTLADIK
GKKVATVRGG TQDLMLLAEL EKNGLTWSDR PGKDVQLIYF NNYADLNQAL AQKYVDVICQ
SEPQSTQAIS AGWGTEIVKP YDTPVGIPYR PLVMTEKMYA EKPDVAARVL KVFVEATKTF
IEKPDLAEKY VREQVFKGQL SSQDYKDAMT NAAFTYDIPA GHMQVTADLM HKYGLGKMVS
PPKSDAEWVK LDLLEKAKAE LGAKTN