Gene M446_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4043 
Symbol 
ID6128945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4510187 
End bp4511233 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID641644198 
Productextracellular solute-binding protein 
Protein accessionYP_001770838 
Protein GI170742183 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0937845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCG ACCGCCGCCG CCTCCTCACC ACCGCCCTCT CGCTCGGCGC GATGGAGCTC 
TTCCCCGGCC TCTCCCTCGC GCAGGCGAGG CCCCTCGTCT TCGCCACCTT CACGGGGAGC
TGGGAGGAGG CGCACAAGGC CGTGCTGGTC CCCGCCTTCC GCAAGGAGAC CGGCAACGCC
CCGATCGTCC TCGACCCGAT GCTGTCCGTC GACCAGATCG CCAAGGTCTC GGCCGCCCGC
GCGAACCCGC CGATCGACGT GATGCTGCAC GATCCGGGCC CGGCGCTCAC CGCCCAGGCG
CAGGACCTCG TCGAGCCCTA CCCGGTCGAG CGCAGCGCCT CCTTCAAGGA CCTCATCCCG
GACGCGCAGG AGGCGACCGG CCCGGCGGCC TTCTTCCAGG TCGTCGGTCT GACCTACAAT
CCCGACACGG TGAGGACGAA GCCCACCTCC TGGGCCGATC TGTGGCGGCC CGAATACAAG
GGCCGGGTCG GCATCACCAA CATGAACTCG ACGCTCGGCA CCGGCTTCAT GGTCGAGATC
GCCAAGATGC ACGGCGGCTC CGAGGCGAAC ATCGATCCGG CCTTCAAGGC CATGGAGGCG
CTCAAGCCCA ACCTCTCGGC GGTGGCGGCC AATCCGGGGG CGCTCGCCAC CCTGTTCCAG
CAGGGCCAAG TCGACATTTC GCCCGGCAAC TTCAACGCCA TCCAGATCCT CAAGGCCAAG
GGCGTTCCGG TCGAGTTCGT GGCGCCCAAG GAGGGGGCGA TCGCCTTCAA GACCGCGATC
CAGATCGTCA GGAACTCGCC CAACCGCGAC CTCGCCTTCA AGCTGATCGA GGCGGCGATC
TCCGAGCCGG TCCAGACCCG GCTGATGCAG GCCCCCTACC TGATCGTGCC GACCAACGCC
AAGGTGACGA TGAGCGGCGA GATCGCCCAG GTGCTCGCCC GGGACACCGA CGACCTGCGC
AGGAAATTCG TGTTCCAGGA CTGGAAGGCC ATCAACGCGC AGCGGGCGGC CTGGATGGAG
CGGTTCAACC GCGAGATCAA GCTCTAG
 
Protein sequence
MILDRRRLLT TALSLGAMEL FPGLSLAQAR PLVFATFTGS WEEAHKAVLV PAFRKETGNA 
PIVLDPMLSV DQIAKVSAAR ANPPIDVMLH DPGPALTAQA QDLVEPYPVE RSASFKDLIP
DAQEATGPAA FFQVVGLTYN PDTVRTKPTS WADLWRPEYK GRVGITNMNS TLGTGFMVEI
AKMHGGSEAN IDPAFKAMEA LKPNLSAVAA NPGALATLFQ QGQVDISPGN FNAIQILKAK
GVPVEFVAPK EGAIAFKTAI QIVRNSPNRD LAFKLIEAAI SEPVQTRLMQ APYLIVPTNA
KVTMSGEIAQ VLARDTDDLR RKFVFQDWKA INAQRAAWME RFNREIKL