Gene Cphy_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3039 
Symbol 
ID5743365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3715605 
End bp3717011 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content37% 
IMG OID641294140 
Productputative aminopeptidase 1 
Protein accessionYP_001560135 
Protein GI160881167 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0109252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGA AAAAAAGTGT TGAGAATTTA AAAAATAACG AATCCAACAA GAATGCTTGG 
TTAAAGTATA CGGATGCAGA TGTAAAAAAA GTTCAGAAAT TAAGTGAAGG TTATAAAGAA
TTTATCTCTG ATTGTAAGAC AGAGCGTGAA TGTGCACTAG AAGTGATTCG TCAGGCTGAA
AAAGAAGGCT ACAAAGATTT GATGACTTTA ACAAAAGTTA AGTCTGGTGA TAAAGTTTAC
TTTAATAACA TGGATAAAGC GGTTGCATTA TTCTTAATTG GTAGTGAGCC TTTAGAAAAA
GGTATGAAGT TACTTGGAGC TCATATCGAC TCTCCACGTA TCGACTTAAA GCAGGTTCCA
TTGTATGAAG ATACAGAAAT GGCATTATTT GATACTCACT ACTATGGTGG GATTAAGAAA
TATCAGTGGG TAACCTTACC ACTTGCTATC CATGGTGTTG TTGTAAAGAA GGATGGAACA
AAAGTAAACA TCGTAATTGG TGAAAAAGAC AGTGATCCAG TAGTTGGTAT TACTGATTTA
CTAGTTCATC TGTCCGCAGA ACAGATGGAA AAGAAGGCTG CTAAGGTAGT AGAAGGAGAA
GACCTAAATG TTCTTATCGG TAGCCAGCCA TTAAAAGGTG AAGAAAAAGA AGCAGTAAAA
GGGAATATGC TTAAACTGTT AAAAGAGTTT TATGACATTG AAGAGGCAGA TTTCTTATCC
GCTGAGTTAG AAGTTGTTCC TGCTGGTAAA GCACGTGACT TTGGTATTGA TCGAAGCATG
GTAATGGGAT ATGGTCAAGA CGATAGAGTT TGTTCTTACA CTTCTATGAA AGCGTTATTT
GAACTTGATA AAATAGACCG TACTGCAGTT TGCTTATTAG TTGATAAAGA AGAGGTTGGT
AGTATCGGTG CAACTGGTAT GCATTCCAAA TTCTTTGAAA ATGCAGTTGC AGAATTAATG
GATAAGATGG GTGAATATTC TGAACTTAAA TTAAGAAGAG CGTTCCAGAA TTCTCATATG
CTTTCTTCTG ACGTAAGTGC AGCATTTGAT CCTAACTATC CATCCGTTAT GGAAAAGAAA
AACTCTGCAT ATTTCGGAAG AGGTATTGTA TTTAACAAAT ACACAGGAGC AAGAGGAAAA
TCTGGTTGTA ACGATGCAAA TCCAGAGTTT ATTGCAGCTT TACGTGCAGC TATGGAAAAA
CACGACGTTA ACTTCCAAAC CTCAGAGCTT GGTAAAGTAG ACCAAGGTGG CGGTGGAACA
ATCGCTTACA TTATGGCTCA GTACAATATG GAAGTAATCG ATAGTGGTGT AGCAGTATTA
AATATGCATG CTCCATGGGA AATTACAAGT AAAGCTGATA TCTATGAAGC AATGAGAGGC
TATGAGGCTT TCCTTCGTGA GATGTAA
 
Protein sequence
MKEKKSVENL KNNESNKNAW LKYTDADVKK VQKLSEGYKE FISDCKTERE CALEVIRQAE 
KEGYKDLMTL TKVKSGDKVY FNNMDKAVAL FLIGSEPLEK GMKLLGAHID SPRIDLKQVP
LYEDTEMALF DTHYYGGIKK YQWVTLPLAI HGVVVKKDGT KVNIVIGEKD SDPVVGITDL
LVHLSAEQME KKAAKVVEGE DLNVLIGSQP LKGEEKEAVK GNMLKLLKEF YDIEEADFLS
AELEVVPAGK ARDFGIDRSM VMGYGQDDRV CSYTSMKALF ELDKIDRTAV CLLVDKEEVG
SIGATGMHSK FFENAVAELM DKMGEYSELK LRRAFQNSHM LSSDVSAAFD PNYPSVMEKK
NSAYFGRGIV FNKYTGARGK SGCNDANPEF IAALRAAMEK HDVNFQTSEL GKVDQGGGGT
IAYIMAQYNM EVIDSGVAVL NMHAPWEITS KADIYEAMRG YEAFLREM