Gene M446_6139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6139 
Symbol 
ID6131014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6756055 
End bp6757119 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content72% 
IMG OID641646234 
Productextracellular solute-binding protein 
Protein accessionYP_001772846 
Protein GI170744191 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCC TCGCGGCCGG ATTGATCGCC TTCGGCGTCC TGCTCGCCTC CGCCCGGGCC 
CAGGTCGACG CGCCCGTCGA CGTGCGGCGC CAGGACGTCG CGGAGGCCGT GCGGGAGGGC
GCATTGGTCA TCCGCGCGAC GACCGACGAG GCGGAGGCCT CCGCGCTGCT GGAGGGGTTC
CGGCAGAGCT ACCCGGGCGT CTCGGTCTCC TACGCCAAGC TGAACTCGTC GAAGCTCTAC
GAGGAGTTTC TCGCCGAGGC CGGGGCGGGA GGCGGTACGG CCGACATCCT CTGGAGCTCG
GCCATGGACC TCCAGGTCAA GCTCGTGAAT GACGGCTTTG CCGAGCGGTA CGTCTCCCGG
GAGGCGGAGG GGCTGCCTGC CTGGGCAGTG TGGCGCTACG AGGCCTACGG CGTCACCGCC
GAGCCGGTCG CGATCGCCTA CAACCGCAGC CTGCTTCCCG ACGAGCGTGT GCCGCGCAGC
CACGCCGACC TCGTCCGCAG CCTGACGCAG GACCCGGAGG CGTGGCACGG CAAGGTCGCC
ACCTACGACC CGGAGCGCAG CGGCGTCGGC TTCCTGTTCC TCACGCAGAA CCTCGCGGTG
ACGCCGCGGA CCTGGGACCT CGTGCGCGCC CTCGGGCAGG TCGGGGCGAA GCTCTACACC
ACCACGACGC ACATGCTCGA CCGCGTCGTC GCCGGCGAGG CGCTGCTGGC CTTCGATGTC
TTCGGCGCCT ACGCCCTGGA GCGGGCCAAG CAGGACCCGA GGCTCGGCGT CGTGCTTCCC
GCCGACTACA CGCTGATCAC CTCGCGCATC GCCTTCATTC CCAAGGCGGC CCAGCACAAG
GCGGCAGCCC GGCTCTTCCT CGACTACATG CTCTCATGGG AGGGGCAGGC GCGGCTCGCG
GCCCGCTCGG TGACGCCCGC GCGCGCCGAC GCGCGCCAGC CCGACGACCC CGTCGCGGCC
GCCCCCCAGC CGATCGTGGT CGGGCCCGAA CTCCTCACCG CCCTCGACCA GATCAAGCGC
AGCCGGACGC TGAAGCAGTG GCGGCGGGTC ATCGAGGGAC GATAG
 
Protein sequence
MRILAAGLIA FGVLLASARA QVDAPVDVRR QDVAEAVREG ALVIRATTDE AEASALLEGF 
RQSYPGVSVS YAKLNSSKLY EEFLAEAGAG GGTADILWSS AMDLQVKLVN DGFAERYVSR
EAEGLPAWAV WRYEAYGVTA EPVAIAYNRS LLPDERVPRS HADLVRSLTQ DPEAWHGKVA
TYDPERSGVG FLFLTQNLAV TPRTWDLVRA LGQVGAKLYT TTTHMLDRVV AGEALLAFDV
FGAYALERAK QDPRLGVVLP ADYTLITSRI AFIPKAAQHK AAARLFLDYM LSWEGQARLA
ARSVTPARAD ARQPDDPVAA APQPIVVGPE LLTALDQIKR SRTLKQWRRV IEGR