Gene Cphy_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0385 
Symbol 
ID5742253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp491883 
End bp494183 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content35% 
IMG OID641291497 
ProductAraC family transcriptional regulator 
Protein accessionYP_001557511 
Protein GI160878543 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTAC GCAATTTTAA GTTGAAAAAA TCAAGTGTAT ATGTGGTTTT TTTCTTTTCG 
TATATCTTAA TTTTAATTGT TGCTTTAAGT TGTGGTTTTG CTTATTATGC ACAGATAACG
GCGAAAATTA CAAAACAAAC CGAGAATACG AAGCAGTTAT TATTAACAGA ACTCCGTACC
AGTGTGGAAT CAAGTACACA ATACATTGAG GAATTATGCA ATGAATTCAC GTTTAATACG
AAGCTGGAAC AATTTGCGAA AGGATTACCC TCTGTTACAC TGCATGAGGC AATGCAGGAG
TTGACATTGC GACAAAAGCC AGGAGCCCTT CTATTCGACT ATTTTCTGTA CATAAAAGAA
ACAGATGAGA TTATCACACC AAACATACGG ATGAAGGCAG ACAAATTTTT TGATATTATG
TATTCCTTTG AGGATATAGA TTATAAAGAA TTTGCAGAAC AATTAAAAGG AAATTATTTT
AAAAATTATT TACCAATAAT GAAAGTAAAT CAATATGACA AAAATCCGGT GGAGATATTG
CCTTTGATCC AAACATTCCC GGTTGGAACC AAGCAGAAGC CACTTGGTCA AGTCATCATA
TTCATTAATG CGGAAAAATT ATTTTCAATG GTGGAACAGT TCCATGTTGC GACAGAATCA
GATGTCTATG TCTATAATAA AAACAATGTG TGTATATTGT CTAGTTCCGG GGCCCCTGAG
TTACCGGTAG AACTAGTCAC TGATAAATCA GCAGGTAATA ATACAAAGGA CAGTATTGTA
TTTCGACAAT TATCAGATAC CTTAGGCTGG AAGTTCATCA TTTCTACACC AAAGGATTTA
TTCTATTCCG AAAACTACGA ATTTTTACTT CGGATGGTTG GGGTTGCAGT TGTGTATCTG
ATAATCGGTA TCGTCTTAGT TTCTTTCCTC GCGGAATACA GCTATAAACC AATTCGTGAA
ATTCGAAAAT ATATTGATAA GAATACGAAA GTGGACGGAA TCGTAAAAAA TGAATATGAG
GTTATTAAGA CTACCTTAAA AGAGCAAATT ACCAATGGAA GAGAATTATC AGATCTACTT
GAAAGTCAAA AGCCTATTGT AAAGCGTGAG GTACTTGGTA AGTTACTGCA TGGAATGGTT
ACCGATTATA GTAGCTTAAA GGAGCGTTTT GTTCCGCTTG GAATCACTTT TACAACGAAT
CTATTTTATA CGGTTGCCCT TGAAGTATCT GAGGGATGTG AATTCTTTCG CATGGATAAA
GCAGAAAATG ATCAGAGTAT GATTCTAGCA AAATTTATCT CAGAGAATGT CGGAGTGGAT
TTATTTCAAG AGAAATTTAA CTACTACTAC CTTGATATGG GGCAAAATAC AGCGGTTCTA
TTGTTAAATC TTCGTAAGGA AGAAGAACTT GAAGAAGCAG ATGAGATCGT GTATCAAAAA
GTATCGGAGC TAATTCAATT TCTGAAAAAG TATTATTCTC TATATATTTA TACTGGCATT
AGTAAACAGC ACCATACGCT AAAAGGAATT CAGCTTTGCT TTGACGAATC CAGAAAGGCA
CTGGAGCAAC ACAAGCTATA CGGAGGATTT GAACCATATT GTTTCTATGA ATTAGAGAAT
CTGGAAGCGG ATTATTATTA CCCGAGTGAG ATGGAATATC AATTATTACG CTATATTAAA
ACTGGAGAAA GTGATAAAGC AAAACAATTA TTACAAAGTA TTCTTGATAT CAATGCACGT
AAGAAAAAAA TTTCGACTAG TGCAGCAAGA GGACTATTAT TTGAAATTTC AATCAGCTTA
AAGAAGCTCT TTGATGGTGC GATGATATCA AGAGGTAAGG AGACAAGTAA CCTTCTTGAT
TTAGATAATT ATCTAAAGAC ACCGAATCTT TTGGTTGCAT TCGATGATTT TTGTTCTTTG
ATTGATTGGT ATCAGAAAGA ACGAGCAGAG GTACCGGTTA GTAATAAAAC AAAAAGATTG
GTGGACTCCA TAGCAGAGTT TATTGAACAA AATATTGGTG AGAACTGGCT TGACTTAAAT
GGACTTTCTT CAGAGTTCGG GGTAACACCT CAGTATATCT CAAACATATT TAAGAAGTAT
AAAGACGAGA ACATTAAGGA CTATATTTCG AAGCTTCGAT TGGCTAAGGC AAAGGAATTA
TTAAGAGATA CGGAACTTCC GATTAAAGAA ATAGCAAATC AACTGGGTTA TGTAGGTGAA
ATTGGGGTAA TACGTTTATT TAAGAAGTAT GAGGGGATTA CTCCGGGAGA TTATCGTAAT
CAGATTCATA ATGTAAATTA A
 
Protein sequence
MLLRNFKLKK SSVYVVFFFS YILILIVALS CGFAYYAQIT AKITKQTENT KQLLLTELRT 
SVESSTQYIE ELCNEFTFNT KLEQFAKGLP SVTLHEAMQE LTLRQKPGAL LFDYFLYIKE
TDEIITPNIR MKADKFFDIM YSFEDIDYKE FAEQLKGNYF KNYLPIMKVN QYDKNPVEIL
PLIQTFPVGT KQKPLGQVII FINAEKLFSM VEQFHVATES DVYVYNKNNV CILSSSGAPE
LPVELVTDKS AGNNTKDSIV FRQLSDTLGW KFIISTPKDL FYSENYEFLL RMVGVAVVYL
IIGIVLVSFL AEYSYKPIRE IRKYIDKNTK VDGIVKNEYE VIKTTLKEQI TNGRELSDLL
ESQKPIVKRE VLGKLLHGMV TDYSSLKERF VPLGITFTTN LFYTVALEVS EGCEFFRMDK
AENDQSMILA KFISENVGVD LFQEKFNYYY LDMGQNTAVL LLNLRKEEEL EEADEIVYQK
VSELIQFLKK YYSLYIYTGI SKQHHTLKGI QLCFDESRKA LEQHKLYGGF EPYCFYELEN
LEADYYYPSE MEYQLLRYIK TGESDKAKQL LQSILDINAR KKKISTSAAR GLLFEISISL
KKLFDGAMIS RGKETSNLLD LDNYLKTPNL LVAFDDFCSL IDWYQKERAE VPVSNKTKRL
VDSIAEFIEQ NIGENWLDLN GLSSEFGVTP QYISNIFKKY KDENIKDYIS KLRLAKAKEL
LRDTELPIKE IANQLGYVGE IGVIRLFKKY EGITPGDYRN QIHNVN