Gene Cphy_2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2466 
Symbol 
ID5742537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3027334 
End bp3028755 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content38% 
IMG OID641293556 
Productextracellular solute-binding protein 
Protein accessionYP_001559566 
Protein GI160880598 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000586108 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TGAAAAAAGC ATTAGCTATG CTCATGGTTT TGACCATGGT TTGTGCTATG 
TTTGCAGCAT GCGGTAAGAA CGACCAAAAA GCAAACAAGG ACCAAACAAA CAAAGAGAAT
ACAAAAGATA ATTCAACTGA TGGTAAGAGT GGTGACAACA AAGAGAAGAC AACAGCTGGG
TCTACTGGCG GTAAGACATT AAGAATATAT TGTTGGAATA CTGAGTTCCA GGATAGATTC
AACGAGTATT ATGCTAGTAA AATTCCATCT GGCGTAACGG TCGATTGGGT TATCAATCCT
AATGAAGACA ATGTATACCA GACAAAGCTC GATGAAGCAT TACAAAAACA GGCCTCTGCT
TCACCGGAAG ATAGAATTGA CTTATTCCTG ATAGAAGCTG ACTATGCATT AAAGTATGTT
GGTACTGACT ATACTTTGGA TGTTGTGAAG GATATTGGCC TAACAGAAGA CGATCTTTCT
CAACAATATC AATACACAAA GGATGTTGTT ACGGTTGATG GCTCTCTTAA AGGTGTTTCA
TGGCAGTCTT GTCCAATGGG CTTTCTTTAT AGAAGATCCA TGGCTAAAGC AGTGTTAGGA
ACAGACGATC CTGATCAGGT TCAAGAGATG ATTTCAGACT GGACAAAATT TGATGCTGTA
GCTGCGAAGA TGAAAGATGC AGGTAACTTC ATGCTATCTG GCTATGATGA TGATTATCGT
GTATTTGCTA ACAACAAGAA ATTGCCTTGG ATTGATGATA ACAATAAGAT CGTAGTTGAT
GATGAGATCA AACAATGGGT ATCGCAGACT AAGACATATA CAGATAAAGG TTACAATAAC
AAAGCTAGCT TATGGTCAGC AGAGTCAACT GCTCAGATGG CAAAAGACGG TAAGGTATTT
GGCTACTTTG GACCGGCTTG GTTTATGGAT TTCTGCTTCA TGGATTACAC ACTTGATGAT
CCAAATCAGC CAAAAGAAAT TGGTAACGGT GGTTACGGTG ACTGGGCTAT GTGTAAAGGG
CCTCAGGGAT CTTACTGGGG TGGTACATGG ATTTGTGGTG CAGCAGGAAC AGATAATATC
GATATCGTAA AAGATATTAT GTTAACTATG ACATGCAATA AAGATACACT TGTTAAGATT
ACTAACAAAT TTGGTGATTT TACTAACAAT GTAGCAGCTA TGACAGAACT AGCTAACAGT
GATTTTGGAT ATCCTTTCTT AGGAGGTCAG AATCATATCA AAGTATTACT TGAATCTGCA
CAAGATATTC ATATTTCTGC AGCATCACCT TTTGATCAGA CTATGACTGA AAAACTTCAA
TTAGCAATGA AAGACTACTT TGAAGGAGTT GTAACAGAAC AGCAGGCTTG GGATAACTTC
TATACAGAAG TTTTAGGAAA GCATCCAGAA CTTAGTAAAT AA
 
Protein sequence
MKKMKKALAM LMVLTMVCAM FAACGKNDQK ANKDQTNKEN TKDNSTDGKS GDNKEKTTAG 
STGGKTLRIY CWNTEFQDRF NEYYASKIPS GVTVDWVINP NEDNVYQTKL DEALQKQASA
SPEDRIDLFL IEADYALKYV GTDYTLDVVK DIGLTEDDLS QQYQYTKDVV TVDGSLKGVS
WQSCPMGFLY RRSMAKAVLG TDDPDQVQEM ISDWTKFDAV AAKMKDAGNF MLSGYDDDYR
VFANNKKLPW IDDNNKIVVD DEIKQWVSQT KTYTDKGYNN KASLWSAEST AQMAKDGKVF
GYFGPAWFMD FCFMDYTLDD PNQPKEIGNG GYGDWAMCKG PQGSYWGGTW ICGAAGTDNI
DIVKDIMLTM TCNKDTLVKI TNKFGDFTNN VAAMTELANS DFGYPFLGGQ NHIKVLLESA
QDIHISAASP FDQTMTEKLQ LAMKDYFEGV VTEQQAWDNF YTEVLGKHPE LSK