Gene Cphy_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2049 
Symbol 
ID5743077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2529276 
End bp2531009 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content36% 
IMG OID641293146 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001559156 
Protein GI160880188 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000035713 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC TATATTGGAT TATCCTGATA GCAGTTGTTC TTGCATGTTC AGGTATTTTG 
ATGTCCCTCC ATCTGTCTGT AACGAAAGAG GAGATGACCT ACCCCTCTTT TTTAGATGCT
GTAAACCAAA ATCAGGTAGC ATCCGTTCAA TTTAAAGAAA ATGACAGTAC TTTAAAGGTA
ATTTTAAATT CAGATAAGGA TACAACTTAT ATTGTTCCGA ACCCAAAGAC AGAGAATTTT
ACAGAATTCT TATTACTTAG GAATATTAAA GTCGACAACA GTGACTCTTA TTCCGCAACT
AAGGTAATTC AGATCATACT AATAATTACT GTCGGAACAG GTGTGTTTTT ATTCATTAGA
ACAAGCGGTG GTAAAGATAA ACCTTTAATG AAAGATGCAG CAAAAAATAA AAAAGCTGAA
AATAGAGTGA AGCTTGGTGA TGTCGCTGGT AATGCTGAAG CAAAATCCAT GGTGGGTGAT
ATCATTGATT TTATTAAAGA ACCTGAAAAG TATAGTGCAC TTGGAGCCAG AATGCCAAAA
GGGGTAATGC TCTATGGCCC TCCTGGAACT GGAAAGACAT TAATTGCAAA AGCCATTGCA
ACAGAAGCTG GTGTACCTTT TTATGCTATG AGCGGTTCCG ATTTCGTACA GATGTATGTG
GGTGTCGGGG CAAGTCGTAT CCGTACATTA TTTAATAAAG CAAAAAAAAG TGAAAAAGCT
GTTATCTTCA TAGATGAAAT TGATGCAATA GGTAAGAAGA GAGCCAGAAG TACCTCAGCT
AGTAATGATG AACGCGACCA AACCTTAAAT GCCTTGTTAA CCGAAATGTC TGGTTTCCAT
GAAAATAAAG GTATCGTCGT TATTGGTGCA ACAAACCGTT TGGATACATT AGACGAAGCC
TTATTACGTC CAGGCCGATT TGACAGACAA ATTGAAGTTG GTTTGCCAGA TATACTTGCA
AGAAAAAAGA TACTTAAGCT ATATGGTGAT AAGAAACCAC TTGGTGATGA TGTTGATTTA
GAGGTACTTG CAAAAAATAC GGTGTCCTTT AGTGGTGCTA TGCTTGAAAA TCTTTTAAAT
GAGGCAGCCA TTCAAGCTGC GAATGAAAAA TCATCCTATA TTCAGTCATC ACATGTGGAT
AAAGCTTTCT ATACTGTAAT AGCAGGTAGC CCTTTACAAG ATCGAAGTTT TATTTCAGAA
AAAGATAAGA GTATTACTGC CTACCATGAA GCAGGACATG CTCTAGCAAC GAAATTATTA
CAACCAGAAC AATATATTTC AAAGGTTACC ATTATACCAA GCGTAAAAGG TGCGGGAGGG
TTTAATCTCT CAATTCCAAA GGATTCTTTA TATCAGTCCA AGCGGCAGAT ACTATGTAGT
ATCCAAATAT TATTGGCTGG AAGGGTAGCA GAGGAACTTA TTTTTGGTGA AGAAGAAATT
ACTACTGGTG CAAGCAATGA TATCCAAAAA GCATCTGCCA TGCTAGTTGA TTACCTAAAT
AAATACGGTA TGGATGATGA AATGGGTCTC TTTAGTACTG TAGTATTAGA AGATCAGTAT
GATACAGACT TTTTAAATAA ATGCCGTAAT CAGATGCATG CGTTATATGA CACTACAAAA
AAACTTATGA CTGAAAACAA AAAACTTCTT ATAGAAATTA CAAATGAACT TCTAGAAAAG
GAATCTTTGA AAGGTGAAGA TATCGATAGA ATTTGTTTAA AAGAAGCAGT ATAG
 
Protein sequence
MKKLYWIILI AVVLACSGIL MSLHLSVTKE EMTYPSFLDA VNQNQVASVQ FKENDSTLKV 
ILNSDKDTTY IVPNPKTENF TEFLLLRNIK VDNSDSYSAT KVIQIILIIT VGTGVFLFIR
TSGGKDKPLM KDAAKNKKAE NRVKLGDVAG NAEAKSMVGD IIDFIKEPEK YSALGARMPK
GVMLYGPPGT GKTLIAKAIA TEAGVPFYAM SGSDFVQMYV GVGASRIRTL FNKAKKSEKA
VIFIDEIDAI GKKRARSTSA SNDERDQTLN ALLTEMSGFH ENKGIVVIGA TNRLDTLDEA
LLRPGRFDRQ IEVGLPDILA RKKILKLYGD KKPLGDDVDL EVLAKNTVSF SGAMLENLLN
EAAIQAANEK SSYIQSSHVD KAFYTVIAGS PLQDRSFISE KDKSITAYHE AGHALATKLL
QPEQYISKVT IIPSVKGAGG FNLSIPKDSL YQSKRQILCS IQILLAGRVA EELIFGEEEI
TTGASNDIQK ASAMLVDYLN KYGMDDEMGL FSTVVLEDQY DTDFLNKCRN QMHALYDTTK
KLMTENKKLL IEITNELLEK ESLKGEDIDR ICLKEAV