Gene Cphy_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3590 
Symbol 
ID5742615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4434607 
End bp4435923 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content37% 
IMG OID641294702 
Productextracellular solute-binding protein 
Protein accessionYP_001560678 
Protein GI160881710 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000122441 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA AATTGTTAAG TGTAACTCTT ATGATTGCAA TGGTAGCTTG TCTATTTACG 
GGATGTTCCA AAAACGATGG CTCGTCAAAG ACAGGTAGTA ACAATGCAGG TTCAAAAACG
TCAGACCAAG TAACACTCAC GGTTTGGTGT TGGGATCCGA AATTTAACTG CTTTGCTATG
GATACTGCTG GCGAAATTTA TGCAAAGGAT CATCCAAATG TAAAGGTTGA AGTAATTGAA
ACTGCTTGGA ATGATATCCA AACTAAATTG ACTACAGCAG TTACCGGGCA GTCGAACACG
TTACCTGATA TTATATTAAT GCAGGACAAT GCATTAGCTA AGAATATCAT TAACTATCCG
GATGCGTTTT TTGATCTAAC AAATTCCGGT ATTAAGTTTG ATCAATTTGC ATCGTTTAAA
ACTGCATTAG GAACTGTAAA TGGGAAAAAT TATTCTGTTC CTTTTGATAA TGGCGCAGCA
ATTACCTGTT ATAGGACAGA TATCTTAGAA CAAGCGGGAT ACACAATAGA TGATTTTACG
AATATTACAT GGAAAGAATT TATTGAAAAA GGTAAAGTTG TTCTTGAAAA GACAGGAAAG
CCTTTATTAT CGAATCAAGC AGGTTCTCCA GACCTCTTAA TGCTTATGAT GCAGAGTGCA
GGTGCTTGGG TACTTGATGA GAATGGTAAA CCAAACTTTA AGGATAACGC AGTTTTAAAA
GAAGTTATCG ATACATATGT AGAATTAGAA AAATCCGGTG TACTTGTTGA AGTAAATGAC
TGGGATCAAT ATGTAAGTAG TATTAATAGT ACAACAGTAG CTGGTGCAAT GAACGGATGT
TGGATTATTG CTACGGTAAC AAGTGCAGCA GATCAATCTG GTTTATGGGG AGTAACCAAT
ATTCCTAAAT TATCATGTGC GGGTGCTACA AATTATAGTA GCCAGGGTGG CTCTTCTTGG
CTAGTATGTG CGAATTCTAA AAATAAAGAT GTAGCAGCAG ACTTTTTAGG TGCAACATTT
GGTGGTAGCG TGGAATTATA TGAAACAATT CTTCCATCTT CCGGTGCATT AGCAACTTAC
TTACCAGCTG GGGAAAGTGC TGCTTATGCT AAACCACAAG ATTTCTTTAG GGGAGATACC
ATTTACTTAA AGATAACAGA GTATGCTTCT AAGGTGCCTC AGGTTTCTTA TGGCGTATAC
AACTATGAAG CAAGGGATGC TATCGGTACT GCGATTACAA AGATTGTAGC AGGTACGGAT
TATAATACTG CAATCAGTGA AGCTCAGAAA GAATTAGAAT TCCAGATGGG TCAATAA
 
Protein sequence
MKRKLLSVTL MIAMVACLFT GCSKNDGSSK TGSNNAGSKT SDQVTLTVWC WDPKFNCFAM 
DTAGEIYAKD HPNVKVEVIE TAWNDIQTKL TTAVTGQSNT LPDIILMQDN ALAKNIINYP
DAFFDLTNSG IKFDQFASFK TALGTVNGKN YSVPFDNGAA ITCYRTDILE QAGYTIDDFT
NITWKEFIEK GKVVLEKTGK PLLSNQAGSP DLLMLMMQSA GAWVLDENGK PNFKDNAVLK
EVIDTYVELE KSGVLVEVND WDQYVSSINS TTVAGAMNGC WIIATVTSAA DQSGLWGVTN
IPKLSCAGAT NYSSQGGSSW LVCANSKNKD VAADFLGATF GGSVELYETI LPSSGALATY
LPAGESAAYA KPQDFFRGDT IYLKITEYAS KVPQVSYGVY NYEARDAIGT AITKIVAGTD
YNTAISEAQK ELEFQMGQ