Gene Cphy_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2023 
Symbol 
ID5743051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2497270 
End bp2499246 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content37% 
IMG OID641293120 
Productband 7 protein 
Protein accessionYP_001559130 
Protein GI160880162 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAT ATGTCATAGC CATAATAATT GGAGGTATTC TTGTAGCTGT ATATGTACTA 
ATTTTCCAGA TCCTTGGGCT TCGAATGATT CGGTCAAATG AAGTTGCTGT CGTAGAAAAA
TGGTGGAGTA TTCATGGTTC ATTAAGAAAT TCAATCATAG CTCTAAATGG CGAAGCAGGT
TATTCTCCTG ATTTACTTCG GGGTGGTATT CACTTTCGTT CAACACTTAT GTATCGTATT
CATAAGTATC CATTAATCAC AATACCTCAA GGGCAAATTG CCTACATCTT CGCAAAAGAT
GGTATTCCAC TCTCACCAAC ACAGTCTTTG GGTAAAGTGA TACCAGAAGC AATTAATTTT
CAAGATGTTA GGGGCTTTTT AAAAAATGGT GGGCAAAAAG GTCCGCAGCG TGGGATACTA
AGAGAAGGTA CTTATGCGAT TAATCTAGCT CAGTTTGTAG TTATTACTCT GAATGAAATT
AAAGCTATTT TGAGTGGTAC TAAAAAAGAA CGATTAGATC TTGAGAGTAT GCAACAAACA
TTAATTCAAC GAAATGCTTT TGTTCCAGTT GTCATTATGG GAAAAGGAAA TGAAACCAGC
GATTTAATGG GTATCGTTAC TATTCATGAT AGTCTTCCGT TACAACAAGG AGAAATCATT
GCCCCATACG TTGGAGAAGG TCACTCAAGT TATCAAGATC CAGAAAAATT CATTGAGCTT
GGCGGACGTA GAGGAAAGCA AATCGAAGTC TTAACCGATG GAACCTACTA TATCAACCGT
TTATTTGCAA CCGTAGAATA TCGTCCAAAG ACTGTGGTAC CAATTGGTTA TGTCGGAGTA
GTCGTAAGTT TCTTTGGGAA ACAAGGAGTC GATACAACGG GTGCAGACTA TAGACATGGT
GAGCTTGTTG AAACCGGTTG TAAAGGTGTT TTACAAAAAC CATTAATGCC AGGAAAATAT
GCGTTTAATA CCGATGCAGG GAAAGTGGTT TTAGTACCTA CAACGAATAT CATATTAAAA
TGGAACCGTG GTGAAGTCGG AGAACACAAA TATGATCAAA ATTTATCGGA AGTTGATATT
ATCACAAAGG ATGCTTTCGA GCCATCTCTC CCACTTTCTG TGGTAATGCA TATTGATTAC
AAACAAGCAC CGTGGGTAAT ACAACGGTTT GGTGATATTA GCATGCTTGT TAACCAATCG
CTTGATCCAC TAGTTAGTGC ATATTTTAAA GACGTTGCGC AAACGAAAAC ATTAATAGAG
CTGATTCAAG AACGTAGTGC CATTAGAGAA CGAGCTGTTG TAGAAATGAA AGAAAAATTT
GAGAAATACA ATCTTCAATT GGAAGAGGTT TTGATTGGTA CTCCAAAATC ATCAAAAGAC
GATATACAAA TTGAAAATAT CCTAACACAG CTTCGAGAAA GACAAATTGC AGAAGAAAAG
AAAATTACAT ATCAAAAGCA ACAGTCCGCT GCGGAGAGTG AAAAATCACT TCGTGAAGCG
CAGGCAATTG CAGAACAACA AAGTTATCTA ACAAAATCTA AGATTCAAAT TGAAATAGAA
GGAAATAACG GTGCTGCTCT TGCAAGTAAG GCAGAACAAG AAGCAAATCA AATCATTGCT
CTTGCGAAAG CAAATGCTTC TAAAGTTCGT TTAGAAGGTG AAGCAGACGC ATCAAAAGAA
AGTAATATCG GATTAGCAAA AGCACAAGCA ATCGATGCTC AAGTAAAAGC ATATGGTGGA
TCCGAATATA GAATTATCCA AGAGATAACA GATAAATTAG CCGATGCTAT TAAGAATACC
CAAGTAGACA TTGTTCCAAA AACAATCGTT ACGATGGGAA ATTCAGGCGA AGATGGTGCT
TCCTCTAGTT CCACTATTTT AGATGCACTT CTTAAGCTTA TTACTATCGA TAAATTAGGG
ATTTCATTAC CAAAGATATC TGAGAGCGAA AAGGAACAAG TAACTATAAA AGATTAA
 
Protein sequence
MNPYVIAIII GGILVAVYVL IFQILGLRMI RSNEVAVVEK WWSIHGSLRN SIIALNGEAG 
YSPDLLRGGI HFRSTLMYRI HKYPLITIPQ GQIAYIFAKD GIPLSPTQSL GKVIPEAINF
QDVRGFLKNG GQKGPQRGIL REGTYAINLA QFVVITLNEI KAILSGTKKE RLDLESMQQT
LIQRNAFVPV VIMGKGNETS DLMGIVTIHD SLPLQQGEII APYVGEGHSS YQDPEKFIEL
GGRRGKQIEV LTDGTYYINR LFATVEYRPK TVVPIGYVGV VVSFFGKQGV DTTGADYRHG
ELVETGCKGV LQKPLMPGKY AFNTDAGKVV LVPTTNIILK WNRGEVGEHK YDQNLSEVDI
ITKDAFEPSL PLSVVMHIDY KQAPWVIQRF GDISMLVNQS LDPLVSAYFK DVAQTKTLIE
LIQERSAIRE RAVVEMKEKF EKYNLQLEEV LIGTPKSSKD DIQIENILTQ LRERQIAEEK
KITYQKQQSA AESEKSLREA QAIAEQQSYL TKSKIQIEIE GNNGAALASK AEQEANQIIA
LAKANASKVR LEGEADASKE SNIGLAKAQA IDAQVKAYGG SEYRIIQEIT DKLADAIKNT
QVDIVPKTIV TMGNSGEDGA SSSSTILDAL LKLITIDKLG ISLPKISESE KEQVTIKD