Gene Cphy_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2569 
Symbol 
ID5741847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3136132 
End bp3137703 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content38% 
IMG OID641293659 
Productextracellular solute-binding protein 
Protein accessionYP_001559669 
Protein GI160880701 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TACTATCAAT TGTCCTGACA ATCGTGATGC TGACTGCACT ACTTTCTGGT 
TGTGGCAAAA CTAATGACAA AGGAGCTTCA AGTAAAAACG TAAATGGGGT TGATATATCA
AAGCCAGTTA CATTAACCTG GTATCTCCAT GGAAGTACTG TTACCGATGA TAAGGCTGTA
TTGGAAAAAG CTAACGCATA CCTAAAGGAT AAATTAAATG TAACCTTGAA GCCAATCTGG
GGTACATGGG GTGACTTCAA TGATAACGTA GTTCTCTCCA TTAGTGGCGG CGATGACGTG
GATATTTATT TCACTTGCTC CTGGACTCAG GATGAATACA ATGCATATGC AAGAAATGGT
GCTTGGCTTC GCCTTGATAA AGAGGGGAAT AACTTCATTG AAAAATATGC TAAAGACATA
TGGAAATTGC TTCCTGACGT TTTAAAAACA GGCGCTACCG TTCAAGGAAA TGATGGTTTA
GGTGTATATG CTATTCCAGG CTATAAGGAT AGCGCTACTC AGAACTGTTG GGACGTGAAT
GTAACGTTGT TAGAGAAATA CGGCTATACG ATTGATGATA TTAAGAATAC AGATTATTAT
GGCTTTGACA AGATTCTTAA GACTGTAAAA GAAGGAGAAG GTAATGACTT CTATCCGCTT
AATGTTGAGG GTATGGTTCT GGAAAGAATG GTAAATAATT CGATTATCGT TGCCGGTGAC
TCCGGAATCA GTAATATGCT TTCATACTAT ATGAATCCTA CAGATACTGC TAAAGAAGGT
ATCTATGGTA ATAAGATACT TAGTAAATTT GAGACACCTG AATATAAGAA ATTTGTTGAA
AAGACTCGTG AATATTATTC AGCTGGATAT ATTGATCCTA AGATGGCTAT TCGCAACCAG
GCAAATGATG CTCGTGTTGC AGCTCAGGAT ACAGGAAAAT ATCTCATAGG TACACAAAGC
TATTCTCTTG GATATGAAGC TGAGGCAAGT GCACGACGTG GAATAGATGT ACAAATGGTT
CCTGTCACAC CTGCTTATCT TGATACCGCA GTTTCTCAAG GTGCTATGAT GGCTATATCT
GCGGGTTGCA AGAATCCGGA AAGAGCACTC ATGTTCCTTA ATCTGTTAAA TACCGATCCA
TATCTTATGA CTTTATTAGA CTATGGTATT GAAGGTGTTC ATTATGACAT AGTGAATGGT
GGCGAAGCTC GACTTAATAA AGATGCTCGT ACCTCTTATT CTCCTTGGAC TAATGGTATG
GGTAATATTA CCCTTCTTCC TCCACTGGAA GGTCAGGGGC TTGATTTCTG GGATAATTTT
AAGGCTTATT ATGGAGGATG TAGTGAAGTT CCAATCCTTG GATATTGTTT TAATGCAACG
AGCGTTGAGA ATCAATTGGC TGCATTAGCA AATGTTGCTC AAGAATTTGA TCTTGCACTT
AATACCGGTT CTATTGATCC AGCAACAAAA TTACCGGAAT TTATACAGAA ACTAAAAGAC
AATGGTATTG ATCAAGTAGT AGCTGAGGCA AATACTCAAT TAGAAAACTT CCTTAAAGAA
AAAAATAAAT AA
 
Protein sequence
MKKVLSIVLT IVMLTALLSG CGKTNDKGAS SKNVNGVDIS KPVTLTWYLH GSTVTDDKAV 
LEKANAYLKD KLNVTLKPIW GTWGDFNDNV VLSISGGDDV DIYFTCSWTQ DEYNAYARNG
AWLRLDKEGN NFIEKYAKDI WKLLPDVLKT GATVQGNDGL GVYAIPGYKD SATQNCWDVN
VTLLEKYGYT IDDIKNTDYY GFDKILKTVK EGEGNDFYPL NVEGMVLERM VNNSIIVAGD
SGISNMLSYY MNPTDTAKEG IYGNKILSKF ETPEYKKFVE KTREYYSAGY IDPKMAIRNQ
ANDARVAAQD TGKYLIGTQS YSLGYEAEAS ARRGIDVQMV PVTPAYLDTA VSQGAMMAIS
AGCKNPERAL MFLNLLNTDP YLMTLLDYGI EGVHYDIVNG GEARLNKDAR TSYSPWTNGM
GNITLLPPLE GQGLDFWDNF KAYYGGCSEV PILGYCFNAT SVENQLAALA NVAQEFDLAL
NTGSIDPATK LPEFIQKLKD NGIDQVVAEA NTQLENFLKE KNK