Gene Cphy_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3420 
Symbol 
ID5743703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4200152 
End bp4201297 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content35% 
IMG OID641294532 
ProductROK family protein 
Protein accessionYP_001560512 
Protein GI160881544 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000111395 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTAG GTAGTAAAGA ACTTATACGA GATATTAATA GTAAGCTAGT GCTTGAAACT 
ATTATACAAA ATGAGCCTAT CTCTCGTGCT GCTATCTCGA AGAAATTAGG ACTTACCAAA
GCCACTATTT CTGCTATCGT TAGTGATTTT ATTAATGATA AATTAGTTGT GGAAATCGGA
AGTGAAGATA CGGGGCTCGG TCGTAAACCA ATTCTATTAA GTTTTCATAA AAAAGCTGGT
TATGCAATTT GCGTTGATAT TGAAGTTTCT AGAATCTCCT GTATGATCTC AGATCTAAAA
GGGGAACAGC GCAGTGTCAA ACAAATCAAA ACACCTTCAG AATCAAACTT AGTACCAGTG
TTAATTGATC TCATACAATC TATGGAATCT GAATACGAAA AAACACCTTA TGGTCTCATT
GGTATTGCAA TTGGTATCCA TGGTGTTGTC CATCAAAACG AGGTATTATT TACCCCTTAT
TACAATTTAA ATGGCATTAA CCTCTCAGAA CAATTAGGTA GCTATTTTGG TGTCCCTGTA
TTCTTGGAAA ATGAAGCAAA TCTTTCTGCA CTAGGAGAGA AAGCTTATCT CCCAGAAACT
TATACTTCAC TTGCAAACCT TAGTATTCAC TCTGGAGTAG GTTTAGGTAT TATATTAAAT
AAACAATTAT ACACTGGATA TCATGGAAAT GCTGGTGAAT TTGGTCATAC TATCGTAGTT
ATGAATGGCC GTACCTGTCC TTGTGGAAAT CAAGGCTGCC TCGAACAATA TGCTTCGGAA
CGTGCGCTAT TAAAAGAGTA TAGTGAAGAT GCTTCACTAG ATGATTTGTT GTTAGCTTAT
GAGAAAAAAG AACCGAAGGC TATGGGAGTG ATGGAGCATT TTGTAAACTA CATGGCAGTT
TGCGTGAATA ATCTACAAAA TACAATTAGC CCAGAGATTA TCATCATTAA TAGTGCATTT
ACCAATGCTT ATCCGGAATT AGCTGAGATG ATTGCAAAAA AAGTTAATAA CAAGTTAGTT
GATAAAATTC CATTGATAGC CTCCAACTTA AAAGATCATT CCATACTACT TGGTGGTATT
ACGGTATTGG TTAAAAACTT CCTTGGCATT AAAAATCTAA GCTTAGGAGT CAAGAATCCA
TCATAA
 
Protein sequence
MVLGSKELIR DINSKLVLET IIQNEPISRA AISKKLGLTK ATISAIVSDF INDKLVVEIG 
SEDTGLGRKP ILLSFHKKAG YAICVDIEVS RISCMISDLK GEQRSVKQIK TPSESNLVPV
LIDLIQSMES EYEKTPYGLI GIAIGIHGVV HQNEVLFTPY YNLNGINLSE QLGSYFGVPV
FLENEANLSA LGEKAYLPET YTSLANLSIH SGVGLGIILN KQLYTGYHGN AGEFGHTIVV
MNGRTCPCGN QGCLEQYASE RALLKEYSED ASLDDLLLAY EKKEPKAMGV MEHFVNYMAV
CVNNLQNTIS PEIIIINSAF TNAYPELAEM IAKKVNNKLV DKIPLIASNL KDHSILLGGI
TVLVKNFLGI KNLSLGVKNP S