Gene Cphy_2848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2848 
Symbol 
ID5742168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3490641 
End bp3492104 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content37% 
IMG OID641293944 
Productglycoside hydrolase family protein 
Protein accessionYP_001559947 
Protein GI160880979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGTA GGAGGAGACT TCTTAGTTGG AACTATAAAG TATTTACAGA GAAGGGAGCT 
AGTCAGATGA AGTATCAGAG TAATATGGTA TCAGATTTAC AAATAGCTTA TATTGGTGGA
GGTTCCCGAG GATGGGCATG GACTTTCATG ACTGACCTAG CAAGAGAGCC AAAATTATCT
GGTACTGTTA GGCTGTTTGA TATTGATAAA TCCGCAGCAG AACAAAATAT GTTTATTGGA
AATTCAATCA CACAAAGAGA AGATGCCATT GGAAAATGGA ACTATGAAAC AAAAGAAACA
TTAGAGGAGG CTCTAACCGG TGCAGATTTT ATTGTGATAT CGATATTGCC TGGGACTTTT
GATGAAATGG AATCTGATGT ACATACCCCA GAACGTTTGG GAATTTATCA ATCTGTTGGA
GATACTGCAG GTCCTGGTGG CATAATACGT GCACTTCGCA CCATTCCTAT GTTTGTTGAT
ATAGCAGAAG CAGTAAAAAA ATATGCTCCT AAAGCATGGG TAATTAATTA TACGAATCCA
ATGACTTTAT GTGTGAAAAC ATTATATCAT GTATTTCCAG AGATTAAAGC ATTTGGTTGT
TGTCATGAAG TATTTGGAAC TCAGAAAGTT TTAAAAGGGA TCGCAGAACA GGTGCTTGGT
ATTGAGGATA TACCAAGAAA TGAAGTTCAT GTTAATGTCT TAGGGATTAA TCACTTTACT
TGGTTTGATT ATGCATCCTA TCAAGGAATA GATTTATTCC CTATCTATAG AGATTATGTA
AAGGAACATT TTGAGGAAGG TTTTATAGAA AATGATGCAA ACTGGGCAAA TACTACATTT
GCATGTTCTC ATCGTGTAAA ATTTGATTTA TTCCAGAAAT ATGGATTGAT TGCAGCTGCA
GGAGATCGTC ACTTGGCAGA GTTTGTACCT GGCGATTGGT ACTTGAAAGA TCCGGAAAAC
GTAAAGAGCT GGAAGTTTGG ATTAACTACA GTAGATTGGA GAAAGGAAGA CCTTAAACAG
AGACTTGAGA AAAGCCATCG TTTAGTGAGT GGAGAAGAGA AAGTTGATTT AAAGGCATCT
GGAGAAGAAG GAATTTTATT AATAAAAGCG CTCTGTGGTT TAGAAAGAGT TGTAAGTAAT
GTAAATATTC CAAATACCAA CAGGCAAATA CCGAATATAC CAGATTCAGT GGTGGTTGAG
ACAAATGCTA TTTTTGAGAG GGATGCCATA CGTCCAATTA TTGCAGGGGA GATGCCAGAC
TCTATCTTAC ATTTAACCAT ACCACATATA CAAAACCATG AACTAGTATT AAAGGCTGCA
CTTACATGTG ATAAAGAGTT AGTAAAGCAG GCATTCGCCA ATGATCCATT AGTGAAAGGA
AGAGCTACTG CGGAGGAAAT TGATTTACTG GTAGAGGACA TGATTCAGGG ATCTATAAAA
TACTTGCCGG AAGGCTGGAA ATAA
 
Protein sequence
MTSRRRLLSW NYKVFTEKGA SQMKYQSNMV SDLQIAYIGG GSRGWAWTFM TDLAREPKLS 
GTVRLFDIDK SAAEQNMFIG NSITQREDAI GKWNYETKET LEEALTGADF IVISILPGTF
DEMESDVHTP ERLGIYQSVG DTAGPGGIIR ALRTIPMFVD IAEAVKKYAP KAWVINYTNP
MTLCVKTLYH VFPEIKAFGC CHEVFGTQKV LKGIAEQVLG IEDIPRNEVH VNVLGINHFT
WFDYASYQGI DLFPIYRDYV KEHFEEGFIE NDANWANTTF ACSHRVKFDL FQKYGLIAAA
GDRHLAEFVP GDWYLKDPEN VKSWKFGLTT VDWRKEDLKQ RLEKSHRLVS GEEKVDLKAS
GEEGILLIKA LCGLERVVSN VNIPNTNRQI PNIPDSVVVE TNAIFERDAI RPIIAGEMPD
SILHLTIPHI QNHELVLKAA LTCDKELVKQ AFANDPLVKG RATAEEIDLL VEDMIQGSIK
YLPEGWK