Gene Cphy_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0931 
Symbol 
ID5741803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1186734 
End bp1188368 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content39% 
IMG OID641292042 
Productextracellular solute-binding protein 
Protein accessionYP_001558054 
Protein GI160879086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0552135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA ATTGGATGAA AATAGCAGCG ATGGGAATGA GTATCGTGCT AGCAGCAGGA 
GCTTTGACAG GTTGCTCTAG AGGAAATAGC AACAAAGAAG ATTCCAAGGT AGAAGAACAA
GGAGTAGATA AGGGTGGACA AGATGTTTCT AAAGAACCTG TAACTATTGA GTGGTTGGCA
TATAATACAT ATTCGCAACC GAATACAGAT ACTGAAATAG TAAAACAAAT TGAGAAAAAG
TTTAATGTTA AATTTGAATT TTGGTACGTG GATGACCAGA AGTGGGATGA AATTCTTGGT
GCAAAACTAT CTTCAGGAGA TATGCCAGAT GTCATGAAAA TAAAGAACAC TGCCAATATC
CCAACTTATG TAAAACAGGG AATTCTTGCA GAATTTACAG ATGAAATGTT GGCTAAGATA
CCATCCTTTA CAAAACAGGT TGAGGAAGCC AATGTAGAAG GAAATGGTCT TATTGATGCA
TATTATGATG GTAAGAGGTA TGCGATTAAA ACACCTTCTA TTTCTGGAAC ATATCCCACT
GTTTTGGTAT GGAGAACAGA TTGGTTAAAG AATCTTGGCA TAGAGAAGAT ACCAGCTACT
ATTGATGAAA TGGAAGAGGC CATGTATGCT ATTCGCAACA ATGATCCAGA TGGTAATGGA
GTAAAAGATA CCTATGGAAT GTCCAATACA GCTATGAATG CAGTATTCGG TGCTTACGGT
GCCATTCCGT TGAAGGAATT TAGAGGGACA GGAGCGCAGA ATCTTTTCTT TACAGAAAAA
GATGGTAAAA TTGAATTTGC GTGTACACAG CCGGAGATGA AAGCGGCACT TGCTACAATT
CAAAAGTGGT ATAAGGAAGG TTTAATTGAT CCAGAATTCA TCACAGGAGA GAATACAGCT
GGATACTGGG CAACTTCACA AGCATTTGAA AATGGAAAAG TGGGAGTGAC AGGAATGGCA
TTGGCTTCGC ACTGGGCGCC ACCAGTAGAA GAAGGAAAAA AGGGTGGAGC ATGTTATGAG
GGATTTGTGG CAATGAATCC AGATGCAAAA TGGGAAGAAA CTGTTAACAT AGGACCAGCA
ATTCAGGGGC CAGAAGGAAA ATCAGGGACA CACACTTGGG GAGCTTTCAG CCCTTCTGGA
TTTGGTATAA CCACAAAATG TGCAGAGGAT CCACGAAAAG TAGATGCAAT ACTAGCCATG
ATTGAAGCAT ACTCCTCAGA TCCAGAATAT GCGCTATTAG CAGGCTGGGG AATCGAAGGT
ACACACTATG AGAAAACCGA AGAAGGCGGT GTACGACGTC TTGAACCATT TACGAAACCA
TCCGAATATA TACAAGATGG AGTTGGGGTT TTTATGCTTG GAACTAACAC TGAATTTGAT
AGAAGCTTGA GCAAAAACGT ATTTGATTTT AGTGATAAGT ATAAGACACC TGGATATCAG
GATATTTTAG TACCAGCAAC AGAGGCAGCA AATCAGTACT TAACTGATTT GAAGATTTTT
ACACTAGATG CTTATATTAA GATAATGACA GGCGAAGAAA GCGTTGATTA TTTTGATACC
TTTGTAAAGG AGTTTAACTC CATGGGTGGA GAACAAATTC TAAATGAAAT CAATGCAGAA
ATAGCAAAAA ATTAA
 
Protein sequence
MRKNWMKIAA MGMSIVLAAG ALTGCSRGNS NKEDSKVEEQ GVDKGGQDVS KEPVTIEWLA 
YNTYSQPNTD TEIVKQIEKK FNVKFEFWYV DDQKWDEILG AKLSSGDMPD VMKIKNTANI
PTYVKQGILA EFTDEMLAKI PSFTKQVEEA NVEGNGLIDA YYDGKRYAIK TPSISGTYPT
VLVWRTDWLK NLGIEKIPAT IDEMEEAMYA IRNNDPDGNG VKDTYGMSNT AMNAVFGAYG
AIPLKEFRGT GAQNLFFTEK DGKIEFACTQ PEMKAALATI QKWYKEGLID PEFITGENTA
GYWATSQAFE NGKVGVTGMA LASHWAPPVE EGKKGGACYE GFVAMNPDAK WEETVNIGPA
IQGPEGKSGT HTWGAFSPSG FGITTKCAED PRKVDAILAM IEAYSSDPEY ALLAGWGIEG
THYEKTEEGG VRRLEPFTKP SEYIQDGVGV FMLGTNTEFD RSLSKNVFDF SDKYKTPGYQ
DILVPATEAA NQYLTDLKIF TLDAYIKIMT GEESVDYFDT FVKEFNSMGG EQILNEINAE
IAKN