Gene Cphy_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3858 
Symbol 
ID5744810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4731867 
End bp4733216 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content39% 
IMG OID641294970 
Productextracellular solute-binding protein 
Protein accessionYP_001560944 
Protein GI160881976 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000022263 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TACTTGCGTT ACTGTTAACT CTTACAATGG TAGTATCTAT GGCTGCCTGC 
ACTAAGAAAG AGGATCCAGC TACTAATACC CCTGGTAGTA GTACAGATAG CCCTAAAGCT
ACTGCTACAA CGGCTCCTAC GGCTGCACCT AAGAAAGTTA CATTAAATGT TACTACTACA
TATGCCGGAA ACGATGGCAA TGCTCAGAAT TACCAAGACG CTGTCGCTAA TTGGGAAAAA
TCAACTGGAA ACAAAGTGAA TGATTCCTCA TCAACTTCTG ATGAAACTTT TAAAGCTAGA
GTTTTAGCAG ACTTCGAAAC AGGATCTGAA CCAGACGTAT TATTCTTCTT TAACGGTGTG
GATTCCAATG CTTTCGTTCA AGCAGGCAAA GTAGTATCTA TCGATGAAAT TCGTCAATCT
TACCCTGACT ATGCATCTAA TATGAAAGAT GGCATGTTAG GCGCTTCTCC TGTTGATGGA
AAGAACTATT CCGTTCCTGT AAATGGTTAC TGGGAAGGTA TGTTTGTAAA CAAAGAAGTA
TGTAAGGCTG CTGGTGTTCA GATTCCAGAT GCTAACACTA CATGGGATCA GTTCCTTGAG
ACTTGTCAGA AGATTAAAGA TGCAGGCTTT GCTCCTATCG CTGTTTCTTT AGCAACTGTA
CCTCACTACT GGTTTGAGTT CTCTATTTAC AATTTCTTAT CACCATCAAC ACATAATGTT
CTTCCAAAGA ACACTACTGA TACTCAGGGA CAGGCATGGG TGAACGGTAT TAACGATATT
AAGATGCTTT ATGAAAAAGG TTATTTCCCA GAAAATACTT TAACAGGTAC TGACGATGAA
ACTGTTCAAT TATTTATTGA TAACAAAGCA GCATTCTTAA TCGATGGTTC TTGGAAAGTT
GGTGGAATCG AAGGTTTAAC AACCGATATT GATAATTTTA CTGTTACTTA TGTTCCAGGA
AAGGGCGACA GAAAGACTAC AGATATCATC GGTGGATTAT CTAGCGGATA TTTTATCTCA
AAGAAAGCTT GGGAAGATCC AGATAAGCGT GCTGCTGCTG TTGATTTTAT TACTTCCATG
ACAAGTGATG AGTTAGTTTC TAAATTCGCA CAAGTATCGG CTACAGCATT AAAGAATGGT
CCTACAGTAG ATGAATCTAA ATTATCATCT CTAGCTAAAG ATGGCCTTAA GTATGTAAAA
GGCGCTACTG GAATGGCTAG CGCTGTAGAA GATCAGGTTC CAAGAGAGTG TAGAGTTCCT
GTTTTTGATG GAATGCCTAA TTTAGTAACT GGTAAAAACG ATATCGCAGA AGCAATACAA
AGCGTTTTAG ATTTAACAGC TGCTCAATAA
 
Protein sequence
MKKLLALLLT LTMVVSMAAC TKKEDPATNT PGSSTDSPKA TATTAPTAAP KKVTLNVTTT 
YAGNDGNAQN YQDAVANWEK STGNKVNDSS STSDETFKAR VLADFETGSE PDVLFFFNGV
DSNAFVQAGK VVSIDEIRQS YPDYASNMKD GMLGASPVDG KNYSVPVNGY WEGMFVNKEV
CKAAGVQIPD ANTTWDQFLE TCQKIKDAGF APIAVSLATV PHYWFEFSIY NFLSPSTHNV
LPKNTTDTQG QAWVNGINDI KMLYEKGYFP ENTLTGTDDE TVQLFIDNKA AFLIDGSWKV
GGIEGLTTDI DNFTVTYVPG KGDRKTTDII GGLSSGYFIS KKAWEDPDKR AAAVDFITSM
TSDELVSKFA QVSATALKNG PTVDESKLSS LAKDGLKYVK GATGMASAVE DQVPRECRVP
VFDGMPNLVT GKNDIAEAIQ SVLDLTAAQ