Gene Cphy_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2075 
Symbol 
ID5744081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2559369 
End bp2560538 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content38% 
IMG OID641293172 
Productamidohydrolase 
Protein accessionYP_001559182 
Protein GI160880214 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.526521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAT TACTAATTAA AAATGGAACC ATTTACAATA GTACTGAAAT TATGCCTTTT 
CAGGCTGATA TCCTAGTCGA GAATGGTAAG ATATTAAAAA TAGAAGAGCA GATTACTGAA
ACGAAGGAAA TGAAAGTCAT TGATGCATTA GGTCTTTTTG TGTACCCGGG TTTAGTTGAA
GCACATTCCC ACATCGGTCT TGATGGTTAT GGTATAGGAT TTGAAGGCCA AGATTACAAC
GAGATGAATG ACATTTTAAC TCCGCATTTA AATGCGATCG ATGGTATTAA TCCTATGGAT
GTTACTCTTA AGAAAGCAGC CCTTGGTGGA GTGACCTGTG CTGCAACGGG ACCAGGAAGT
TCTAATGTAC TTGGTGGAAC ATTTACTGCG ATTAAGATGA CTGGCAATCG TGTAGATCGC
ATGGTAGTAA AAGAAAAGGT TGCTATGAAG TGTGCTTTTG GAGAAAATCC AAAGAGAGTC
TATAAAGATA AGAACAACTA TTCTAGAATG TCCACAGCCT CTAAACTTAG AGAAATGCTA
AATAAAGCGA AGGAATATCA AGCAAAGCTA GTTGCAGCTG GAGAAGATAT CTTTAAAAAG
CCTAGCTATG ATGCGAAACT AGAAGCTCTT TTACCAGTTT TAAATCGTGA CATCCCTTTA
AAAGCACATG CTCATCGTTC TGACGACATC TTTACAGCAA TCCGAATTGC AAAAGAGTTC
GATTTAAGAT TGACAATCGA ACATTGTACA GAAGGCCATC TTATATCAGA AGAATTACAA
AAAGATGGTT ATCCAGTAGC AGTTGGACCT TCCTTTGGTC ATGCAACCAA ATATGAGCTC
CGCAATAAGA CATTCGAAAC TCCTGGTATC TTAGCGAAGG CTGGTTTGCA GGTATCCATT
ATTACAGATA GTCCTGTTAT TCCTCAACAT TACTTGTCGT TATGTGCTGG TTTAGCTGTA
AAATCAGGAA TGGAGCCATT TGCAGCACTA CAAGCAATAA CCATTAATCC TGCAAAACAT
ATCGGTATTG AAGACCGTGT CGGCTCTCTT GAAGTAGGTA AGGATGCTGA TATTGTCATC
ACAGATGGTG ATATCATGGA TTCCATGACT TCAGTTCTAT ACACATTTAT CGATGGTAAT
GAGATTGATA GAACAGAGAA TTACTTATAA
 
Protein sequence
MSQLLIKNGT IYNSTEIMPF QADILVENGK ILKIEEQITE TKEMKVIDAL GLFVYPGLVE 
AHSHIGLDGY GIGFEGQDYN EMNDILTPHL NAIDGINPMD VTLKKAALGG VTCAATGPGS
SNVLGGTFTA IKMTGNRVDR MVVKEKVAMK CAFGENPKRV YKDKNNYSRM STASKLREML
NKAKEYQAKL VAAGEDIFKK PSYDAKLEAL LPVLNRDIPL KAHAHRSDDI FTAIRIAKEF
DLRLTIEHCT EGHLISEELQ KDGYPVAVGP SFGHATKYEL RNKTFETPGI LAKAGLQVSI
ITDSPVIPQH YLSLCAGLAV KSGMEPFAAL QAITINPAKH IGIEDRVGSL EVGKDADIVI
TDGDIMDSMT SVLYTFIDGN EIDRTENYL