Gene Cphy_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3920 
Symbol 
ID5742044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4814910 
End bp4816214 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content33% 
IMG OID641295034 
Producthypothetical protein 
Protein accessionYP_001561006 
Protein GI160882038 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000253725 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA AGTTAAAACT TTCTGGAATA AACAAGAAGA AAAAAGCTGT AGTTCATGAA 
GAGACCGGTA AAATTAAGGT AATAGTTCTA AAGGAAGAGA AGGTGTCTGA AGAACAGGAA
AAAGAAATAG TAGAGCGTCC TAAGGATTAC AAATTCCGAT TCGAGCCAAC CAAAAAGTAT
TTTACCATTA GTGTATATGC ATTTTGTGTC ATTGCACTTT CTATTATATT TAAGTTAGCA
TTAGAAAATA TTCCTTCTTT TTCAACAGGC CTAAAAAACT TTGGGGTAGT GATACAACCA
TTTATCGCTG CTTTCTTTAT TGCTTTTATT GTAAATCCAA TTGTGAAAGC GTTAGCGGAT
CGATTTTATG GCAAAATGCT AAATATAAAG AAACCAAAAG TCTGTTTGGC TCTAGGTATT
GCTACCACTT ACATTATAAT TATTGCAGTC ATAGTAGTAA GTTTTACTTA TATCATACCG
CAGGTTACAA ATAGCATAAC GGAATTAGTT TTAAATAGCG ATGCTTTATA TAAAAAGGGT
GAAACCTGGT TAAATGATAT TGGTGAAAAG TTTCCAATAC TTGATACGGG ATATATACAA
GATAAAATTG AAGCATCTTT ACCACAATTA CTTTCTTTTG GAACAGACTT TGTTAAAAAT
GTATTACCAA AGATATTAAA TGTCTCAATT TCAATTGCGA AGACAGCGAT TAATATTTTA
TTATCAATTG CGATCTCTAT TTATATGCTA TATGATAAAA GGATGTTATC AAAAAATGCA
GCACGTATAA TCTATGCATT TATTCCGAAA AAGAAAGCAG ATTCCTTCCT TGATGTAACA
AGGGAATGTG GATCAATCTT TACAGGTTTT ATTGTTGGAA AGACGATTGA TTCCACCATT
ATTGGTATTC TTTGCTTTAT ATTAATGTCA ATCCTTAGAC TGCCATATGC GATCTTAATC
AGCGTTATTG TAGGAGTTAC GAATATGATT CCTTACTTTG GACCATTTAT CGGAGCGGTA
CCAGGAATAC TTTTATTTTT ATTTATCAGT CCTATTCAAG CATTAGTCTT TGCGATTATG
ATACTTGGCC TACAGCAGTT TGATGGCTGG ATTTTAGGAC CTAAGATTCT TGGTGATTCA
ACTGGGCTAA CTCCTTTATG GGTTATTTTT GGTATTACAG TCGGAGGTGC TTACGGCGGT
GTAATAGGAA TGTTTTTAGG AGTTCCTTTC GTTGCAGTGA TTGCTTATCT AGCAGGTATG
TTTATTACGG GACGTTTAAA GAAACGCAAT ATAGAAATAC GATAA
 
Protein sequence
MAKKLKLSGI NKKKKAVVHE ETGKIKVIVL KEEKVSEEQE KEIVERPKDY KFRFEPTKKY 
FTISVYAFCV IALSIIFKLA LENIPSFSTG LKNFGVVIQP FIAAFFIAFI VNPIVKALAD
RFYGKMLNIK KPKVCLALGI ATTYIIIIAV IVVSFTYIIP QVTNSITELV LNSDALYKKG
ETWLNDIGEK FPILDTGYIQ DKIEASLPQL LSFGTDFVKN VLPKILNVSI SIAKTAINIL
LSIAISIYML YDKRMLSKNA ARIIYAFIPK KKADSFLDVT RECGSIFTGF IVGKTIDSTI
IGILCFILMS ILRLPYAILI SVIVGVTNMI PYFGPFIGAV PGILLFLFIS PIQALVFAIM
ILGLQQFDGW ILGPKILGDS TGLTPLWVIF GITVGGAYGG VIGMFLGVPF VAVIAYLAGM
FITGRLKKRN IEIR