Gene Cphy_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2038 
Symbol 
ID5743066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2518593 
End bp2519861 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content36% 
IMG OID641293135 
Productamidohydrolase 
Protein accessionYP_001559145 
Protein GI160880177 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0984083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTA GATTCTATCA TGCTCGCATC GCTACTATGC AAAACGATTG TGGTATCATA 
GAAGGAGAGC TTTGGGTTAC AAATAATCGA ATCTCTTATG TAGGTACCGA AAGAGAAAGC
CAGATTTCAT GGGATAGAGA AATTGATTGT AAAGGGAATC TATTAATGCC AGGATTTAAG
AATACTCATA CGCATTCTGC TATGACATTT CTTAGATCCT ACGCTGATGA TTTACCATTG
CATGATTGGT TAAATAAACA GGTATTTCCG ATGGAAGCGA AATTATCACC AGATGATATC
TATCACTTAT CAAAACTAGC CATCTTAGAG TACCTAACTA GTGGTATGAC AGCAAACTTT
GATATGTATA TTACACCAGA TACGATAGTC CAGGCTTCGA TAGATACTGG ATTTCGAACC
GTTCTTTGTG GTGGAGTAAG TAACTTTTTG CATTCTGTAA CACAGGTTGA GGATTGGTAC
AAAAAATACA ATAATTATCA TGAGCTAGTT TCATTTCAAC TCGGTTTTCA TGCGGAATAT
ACAATAGATA GAGCGACACT TATGGATTTA GCTTCGTTAG CAAAACAGCT AAAAGCTCCA
GTTTATACCC ATAACTCAGA GACAAAAGCA GAAGTTGATG CATGTATATC AAGAAATCAA
ATGACTCCAA CTGCATATCT TGATTCCTTA GGTATCTATG ATTTTGGTGG TGGCGGATAT
CATTGTGTTC ATATGACCGA TGAAGACCTT TACATCGTAA AGAGAAGAGG AGTTTCAGTT
GTTACAAATC CTGGTTCTAA CACGAAATTA GCAAGTGGAA TTGCACGTAT TGAAGATATG
TTATCACTTG GAATTAATAT AGCAATCGGA ACAGATGGCC CTGCAAGCAA TAATTGTCTT
GACATGTTTC GTGAGATGTT CTTAGTTACA GGACTTTCTA AATTAAAGAA TGAAGATGCG
TCCTCAGTAG ATGCAAATGA AGTTCTTAGG ATGGCAACTG TAAATGGTGC AAAAGCGATG
TGTCTTACAG ACTGTGATTG TCTCGCTGAA GGAAAATTAG CAGATTTAAT CATGATTAAT
TTACATCAGC CAAATATGCA GCCAATGAAT AACATTACTA AAAACATTGT CTATAGCGGA
AGTAAAACCA ATGTTAAATT AACAATGGTC AATGGCAAGA TACTCTATGA AAATGGTGAA
TTTTTCGTAG GAGAAGATCC AGAGGCTATT TATGCGAAGG CGAATGAAAT AATAAATCGT
ATGAGATAA
 
Protein sequence
MNIRFYHARI ATMQNDCGII EGELWVTNNR ISYVGTERES QISWDREIDC KGNLLMPGFK 
NTHTHSAMTF LRSYADDLPL HDWLNKQVFP MEAKLSPDDI YHLSKLAILE YLTSGMTANF
DMYITPDTIV QASIDTGFRT VLCGGVSNFL HSVTQVEDWY KKYNNYHELV SFQLGFHAEY
TIDRATLMDL ASLAKQLKAP VYTHNSETKA EVDACISRNQ MTPTAYLDSL GIYDFGGGGY
HCVHMTDEDL YIVKRRGVSV VTNPGSNTKL ASGIARIEDM LSLGINIAIG TDGPASNNCL
DMFREMFLVT GLSKLKNEDA SSVDANEVLR MATVNGAKAM CLTDCDCLAE GKLADLIMIN
LHQPNMQPMN NITKNIVYSG SKTNVKLTMV NGKILYENGE FFVGEDPEAI YAKANEIINR
MR