Gene Cphy_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1118 
Symbol 
ID5741953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1410194 
End bp1411819 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content37% 
IMG OID641292223 
Productextracellular solute-binding protein 
Protein accessionYP_001558235 
Protein GI160879267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000575789 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CGGTAACATT ACTGTTGGTT CTGACCATGG TGGTAAGCTT ATTTGCAGCA 
TGTGGTAAGA AAAATGGATC AAGCGAAACC GGCACAAAAG ATCCTGTGGC AACAAGCGGT
GCAAAAGAAC CTGACAAACA AGATCCAGGC AATAAAGAGC CTGAAAAACA AGACCCTGTT
AAAATCAAGA TTTATTACTC TGATAATGCA ACCTTACCAT TTAAAGAAGA TTGGTTAGTT
ATAAAGGAAG CTGAGAAGAG ATTTAATGTT GATTTCGATT TCGAAGTAAT TCCAATTGCA
GATTATCAAA CAAAAGTTTC TTTAACATTA AATACAGGAA ATAACGCTCC AGATGTCATC
CTTTATCAGT CAACGCAGGG AGAGAATGCA TCTCTTGCTC TAAATGGTGC TCTAGTACCA
ATCAGTGACT ATGCTGAATA TACACCTAAC TTTAATGCAC GTGTAGAAGA GTTTGGTCTA
ACTGGTGCTA TAAACAGATT AAACCTCGCA GACGGAAAAC GTTATTATAT GCCTGGCTTA
TTTGACGTTC CTTTCTATGA TGGTGGACTT ATTTTAAGAG AGGATTTCTT AGAGGCAGAA
GGATTAGCTG TACCTAAGAC ATTTGATGAT TTATATAATA TCTTAAAGGC ATACAAAGCA
AAGAATCCTG ATTCTTATCC TTTAACTATC TTAGCTGGTC CTCGTGTATT ATACCGTATG
ACAATGCCAT CCTTTGGTGT TAGTTTAGGT AAGAACGGAG CTGGCGGAAC GAATACCTTA
AGTTGGGATT ATGAAAAGGG CGAATATTTT GAAGGTGCTA TCAGTGATGG TTATAAACAA
TATATTAGCT ACCTTGCAAA ACTTTACAAT GAAGGATTAC TTGATCCTGA AATGGCAGAC
CCAATCGATG GCGATAAATG GTCTCAAAAG ATGGCAAGCG GAAAATCTAT GGCTACCTAT
GCATACTATG ACCAGATTGG TGGTGTAAGT GCTTCTACTG AAATCGAAGG CTTTAAATTA
CAGATGTACC CATCATTAGA GGGACCTGCT GGTGCTCATC ATCAGCAAAA GAACCGTACT
GGTTCCGGTA TTATGTTCCC AGCAGCTACT GCACAAAGAA AAGACTTTGA AAGAGTTGTG
AGAACAATTG ATGAAGTATT CTTCTCCGAA GAAGGTGCTA AATTATGGTG CTTAGGAGTA
GAAGGCGTAA CATATACAGA AGAAAACGGA GTAATCAAAT ATTCTGATGA GTTAGTAAAT
TCAGCAGAAG GTGTTTATAA AACACTTCAA GTAAAATACG GCTGTGGTTC TGACGTTACC
CAATTAGTAT GGGTTAACGA ACGTGAAATG ACAAAATATG ATGAGAATTA TGCACGTATC
AATAAAGAAG TTGCTGCTAT GGGAGATGTT ATTCAACAGA TACCTCCAAC ACCATTATTT
GATGATATGA AAGCAGAAGA TGCGGGCGTT TTGCAAACTC CATTATTTGA TACCTTCAGT
GTATGGGCAG ACGCATTTAT AACTGGTAAG AAGAGTGTAG ATAATGATTG GGATGCTTAT
GTAAATGAGA TGAAAACATT AAAAATTGAC GAATTCTGTA AGATTTATAA TGATAATCTT
AACTAA
 
Protein sequence
MKKTVTLLLV LTMVVSLFAA CGKKNGSSET GTKDPVATSG AKEPDKQDPG NKEPEKQDPV 
KIKIYYSDNA TLPFKEDWLV IKEAEKRFNV DFDFEVIPIA DYQTKVSLTL NTGNNAPDVI
LYQSTQGENA SLALNGALVP ISDYAEYTPN FNARVEEFGL TGAINRLNLA DGKRYYMPGL
FDVPFYDGGL ILREDFLEAE GLAVPKTFDD LYNILKAYKA KNPDSYPLTI LAGPRVLYRM
TMPSFGVSLG KNGAGGTNTL SWDYEKGEYF EGAISDGYKQ YISYLAKLYN EGLLDPEMAD
PIDGDKWSQK MASGKSMATY AYYDQIGGVS ASTEIEGFKL QMYPSLEGPA GAHHQQKNRT
GSGIMFPAAT AQRKDFERVV RTIDEVFFSE EGAKLWCLGV EGVTYTEENG VIKYSDELVN
SAEGVYKTLQ VKYGCGSDVT QLVWVNEREM TKYDENYARI NKEVAAMGDV IQQIPPTPLF
DDMKAEDAGV LQTPLFDTFS VWADAFITGK KSVDNDWDAY VNEMKTLKID EFCKIYNDNL
N