Gene M446_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3858 
Symbol 
ID6131997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4304013 
End bp4305089 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID641644023 
Productextracellular solute-binding protein 
Protein accessionYP_001770665 
Protein GI170742010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGC CGAGCGCACC CGGACTTGGT GTCGCGAGTG GAACGCGGCC GACGCGCCGC 
TCTCTCCTCG CCGGAGCCGC CGGATTCCTG GCGGCCCCGG CCGTCCTCAC CGGCAGAGCG
CGCGCGGACA CGACCCTGAC GGTGACCTGC TGGGGCGGCG ACTACCGGGC GGGCATCGAC
AGGATCTTCG CGCAGCCCTT CACGAAGGAA ACCGGGATCG CCGTCCGTCT CGTTGACAAT
GCCGACCTCG CTCGCATGAA GGCGCAGGTC CAGACCGGCC GCGTCGAATG GGACGTTTTC
GACAGTGTCG GACCGCAGAT CACGGCCGGC GCGAAGGAGG GCCTCTGGGA GGAGGTCGAC
GGCAAGATCG TGGACCGCTC GGACCTCACC GCTCCCGGCG GGCCGAGCTA TGTCGGGACC
TACCTGTTCG CGGGCGGGAT CGCGTACGAT CCCAAGCGGT TTCCCGAAGG CAAGTATCCC
GTCACCTTCA AGGATTTCTG GAACGTCGAC GGCTTTCCGG GCCGCCGCGG CCTGCGCACC
AGGGTGAGCG AGAACCTTGA GATCGCGCTG CTCGCCGACG GCGTCGCCCC GAAGGACCTC
TATCCGCTGG ACGTCGAGAG AGCCTTTCGG TTGCTCGATC AGATCAAGCC TGCCGTGAAG
AAGTGGATCG AGACCACGCC ACAATCGCTG TCTCTCGTCA CCACGAACGA AATCGACTTC
TCCTACACCT ACATGTCGCG CGTGCGGCCG GCGCAGCTGG CCGGGAGCTC CGTCTCCCTG
TCGACGCAGC AGACGCTCAA CTCCCTCGAA TATCTGGCCG TCGCCAAAGG CTCCCGCAAC
CGGGAGGCCG CGTTCCGCTA CATCGCGTTC TGCCTGAGGC CCGACCGCCA AGCGGCCTTC
GGCGAAATGC TGTTCTTCAG CCCAAATTCG CGCAAGGGAT TCGAGGCCTC CACCCCGGCC
GCCCGCCAGT ACATGCCCGA CATGGCGAGC CCGAAGAACG CGATCCTCAA CGACGATTGG
TGGGCGGACC GCTACACGCC GCTTCAGAAG CGCTTCACGG AGTGGCTCCT GGTCTGA
 
Protein sequence
MSKPSAPGLG VASGTRPTRR SLLAGAAGFL AAPAVLTGRA RADTTLTVTC WGGDYRAGID 
RIFAQPFTKE TGIAVRLVDN ADLARMKAQV QTGRVEWDVF DSVGPQITAG AKEGLWEEVD
GKIVDRSDLT APGGPSYVGT YLFAGGIAYD PKRFPEGKYP VTFKDFWNVD GFPGRRGLRT
RVSENLEIAL LADGVAPKDL YPLDVERAFR LLDQIKPAVK KWIETTPQSL SLVTTNEIDF
SYTYMSRVRP AQLAGSSVSL STQQTLNSLE YLAVAKGSRN REAAFRYIAF CLRPDRQAAF
GEMLFFSPNS RKGFEASTPA ARQYMPDMAS PKNAILNDDW WADRYTPLQK RFTEWLLV