Gene Cphy_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1708 
Symbol 
ID5741539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2097665 
End bp2099221 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content37% 
IMG OID641292808 
Productextracellular solute-binding protein 
Protein accessionYP_001558819 
Protein GI160879851 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA GAAAAATGTT ATCAGTTCTT ATGAGTGCTA TGGTAGTAGT TTCTATGGTG 
GGGTGTCAAG GAAATAAGGG GGTTCAGGAT TCATCAAAAG GTACTAATGT ATCTGAAGTA
AGTGGCAATA ATAATAGTGA TTTGGAGTTT GTAGAATTAA ACTACTACGT ACCGGCAGCA
CAGGTGCCAG CTGGATTAAA GGATGCTCAA AAAGCTATCA ATGAATATTT AAAAGAAAAA
ATAAATGCTT CTATTAATAT TAATGTTATA GACTGGGGAT CTTATGGTCA AAAAATGAAT
GTTAAGGTGG CATCAGGTGA TGACGTTGAT ATTATGTGGA CTTCACAAAT GAGCGAGTTT
GGATATGGAA CCAATGCTAT GAAAGGCGCG TTCTTAGAAT CCGAGGAGTT ATTTAAACAA
TATGCCCCTA AAACATGGGA AATCATGGGA CAATACTGGG ATCAAGTACG AGTAAACGGA
AAAATTTACG GAATGCCAAA TCAATTAGGA TATGCCAAAG AAAATGGATA TCAGGTACGT
AAAGAATTTG CTGATAAATA TGGACTAAAT ACGGAATATA CCAACAAGAC AATTGCACCG
GGAGAGGCGC TTAGGGTAGA AGGATTGGAA CCTTATCTTG AACAAGTAAA GAAAAATAAT
CCGGATATGA TACCGTTATT ATGCGGAGTA AGTGGATTGG TTCCAGGAAA TTCAGCAGAA
GTATCTATGG GGTTATGGAA TATTAATAAC TTTGCATGTA CGGATATGGA AGATAATGAT
TTAACGGCAT TGAATTTTTT TGAAACAGCA GAGTATAAGG AAAGACTGGA ATTAGCGAGA
GATTGGTATT TAAAAGGATA CATAAATCCA GATGCTGCTA CTGTAACAAA TTGGACGCCT
TTGTTAAATC GTGCAGGTGC GGTTTATGGA GATGTTGGTG TCGGAACTGG AGTAGAAAAG
ATGCCAAGCT ATTGGCAGCC TTGGGGTGGA GAGATTGCGT ATATTCCTAC ATCTGAGACA
TTTACAGCCG CAACAGCAGC CCAGGTATCC GTATTGGCAA TAGGGAAAAA TTCTAAGAAT
CCTGAGCGTG CAGCCATGTT CCTAGAATTA TCCCATTCAG ATCCGGAGTT AGTACATTTA
TTATGCTATG GAGTGGAAGG TGTTAACTAT ACAAAAGATT TAGTATATTA TACACCGATC
AAAGGAAACG AGTTTGGATT GGACGATTGG ACGAATGTTA ACTGTCAACT AAAACTTTTG
GTTGAGGGTC AGCCGGTTGA TTCCATAGAA ACAGAAAAAG AACGTAATAA TAATGCAAAA
GCTCCTGGGT TTCAAGGTTT TGTCTTTAAT GAGGAACCTG TTAAAACAGA GATAGCAAAT
TGCAACTCTA TTATTAAAGA ATATACTCCA AGCCTTCAAA CAGGAGCAGT TGATATTGAT
CCTACGTTAC AAGAGTTTAA TCAAAAATTA AAAGCAGCAG GAGTTGACAA AATTATAATA
GAAACACAAA AACAAATTGA TGCATGGAAA ACAGCAACGG GATATACAGC TAAATAA
 
Protein sequence
MKLRKMLSVL MSAMVVVSMV GCQGNKGVQD SSKGTNVSEV SGNNNSDLEF VELNYYVPAA 
QVPAGLKDAQ KAINEYLKEK INASININVI DWGSYGQKMN VKVASGDDVD IMWTSQMSEF
GYGTNAMKGA FLESEELFKQ YAPKTWEIMG QYWDQVRVNG KIYGMPNQLG YAKENGYQVR
KEFADKYGLN TEYTNKTIAP GEALRVEGLE PYLEQVKKNN PDMIPLLCGV SGLVPGNSAE
VSMGLWNINN FACTDMEDND LTALNFFETA EYKERLELAR DWYLKGYINP DAATVTNWTP
LLNRAGAVYG DVGVGTGVEK MPSYWQPWGG EIAYIPTSET FTAATAAQVS VLAIGKNSKN
PERAAMFLEL SHSDPELVHL LCYGVEGVNY TKDLVYYTPI KGNEFGLDDW TNVNCQLKLL
VEGQPVDSIE TEKERNNNAK APGFQGFVFN EEPVKTEIAN CNSIIKEYTP SLQTGAVDID
PTLQEFNQKL KAAGVDKIII ETQKQIDAWK TATGYTAK