Gene Cphy_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0894 
Symbol 
ID5741766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1144874 
End bp1145986 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content35% 
IMG OID641292006 
Productextracellular solute-binding protein 
Protein accessionYP_001558018 
Protein GI160879050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT CTAAACTTTT TGTAGCGCTC CTTCTTGTAA GCTCACTGGT ATTCTCCGGT 
TGTGGCTCCA AGGAAGACAA AGGTAATAAT ACTACTCCGG GAAATGGAGG GGAAAATCCA
GGAAACAGCA GTTCTTCCAA TAAAGGCGAA GTCTATGTTT ACAACTGGGG TGAATATATT
GACCCTGATG TTAAAAAAAT GTTTGAAGAC GAGACTGGTA TTAAATTAAT CTATCAGGAA
TTCGAATTAA ACGAGGACAT GTATCCTATC ATTAAAACCG GTGCAGTAAA TTATGATGTT
GTTTGCCCTT CTGATTATAT GATCGAGAAA ATGATTCAGG AAAACTTACT TGCTGAAATT
AATTTTGATA ATATTCCAAA CATTACAAAC ATCGATGAAA TGTATTTAAA AACAGCGGAA
AGTTTTGATC CAGGCAATAA ATACAGTGTT CCTTATTGTT GGGGTACGGT TGGTATTCTT
TATAACAAAA CTATGGTAGA CGGACCAGTT GATAGTTGGA GCGTATTATT TGATGAAAAG
TACAAAAATG ATATCTTAAT GATTGATAGC GTTCGTGATG CTTTCATGGT AGCATTAACC
TATCTTGGTT ACGACCAAAA CACAACAGAT GAGAAAGAAT TAGATGCTGC TAGAGATTTA
TTAAAAAAAC AGTATCCATT AGTTCAAGCA TACGTTGTTG ACCAGGTTCG TGACAAGATG
ATTGGTGAAG AGGCCGCTCT TGGTGTTATC TACTCTGGTG AAGCAATTTA CACAAAACGT
GAGAATGAAA ATCTTGAATA TGTGGTGCCA AAGGAAGGCT CTAACGTTTG GATTGATGGT
TGGGTAATTC CTAAGAATAG TAAGAATAAA GAAAATGCAG AAGCATGGAT TAACTTTATG
TGCCGTCCTG ACATTGCATT AAAGAACTTT GAATATATTA CTTATTCTAC ACCAAACAAA
GCAGCCAGAG AATTAATTGA AGACGAAGAC ATTAAGAACA GCCAAGTTGC TTTCCCTGAT
GCGTCTATAC TTGATCGCTG TAAGTCTTTC AAATATCTTG GCGAAGATAT GGAAAATATC
TATGTGAAAA AGTGGAATGA TGTAAAATAT TAA
 
Protein sequence
MKKSKLFVAL LLVSSLVFSG CGSKEDKGNN TTPGNGGENP GNSSSSNKGE VYVYNWGEYI 
DPDVKKMFED ETGIKLIYQE FELNEDMYPI IKTGAVNYDV VCPSDYMIEK MIQENLLAEI
NFDNIPNITN IDEMYLKTAE SFDPGNKYSV PYCWGTVGIL YNKTMVDGPV DSWSVLFDEK
YKNDILMIDS VRDAFMVALT YLGYDQNTTD EKELDAARDL LKKQYPLVQA YVVDQVRDKM
IGEEAALGVI YSGEAIYTKR ENENLEYVVP KEGSNVWIDG WVIPKNSKNK ENAEAWINFM
CRPDIALKNF EYITYSTPNK AARELIEDED IKNSQVAFPD ASILDRCKSF KYLGEDMENI
YVKKWNDVKY