Gene Cphy_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1074 
Symbol 
ID5741909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1358682 
End bp1360004 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content36% 
IMG OID641292179 
Productextracellular solute-binding protein 
Protein accessionYP_001558191 
Protein GI160879223 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00779107 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AATTAATTAG TATCTTGTTA TGTTTAAGCT TATGTATGGC TTTTTTTGCA 
GGTTGTAGTA AGCAAGACTC CAAGGGAAAA GTTTATTATC TAAGCTTTAA ACCGGAATCC
GATGAGATTT GGAAAGAGAT TGCTAAGGCT TATACCGAAG AAACCGGTGT TGAAGTTGTT
GTGAAAACAG CTGCAGCTGG TACTTATGAG CAAACATTAA AAGCAGAGAT TTCTAAGAGT
AATGCACCTA CCTTATTTCA AATCAATGGA CCTGTTGGAT ATCAGTCTTG GAAGGATTAT
TGTCTTGATT TACAAGGTAC TGATCTTTAT AGCTGGCTCC TAGATAAAAG TTTAGCTGTA
TCTTCTGGTG ATGGTGTTTA TGGTATTCCA TATGTGGTTG AGGGTTATGG AATTATTTAT
AATGATGCAA TAATGCAAAA ATACTTTGCT TTGTCTAATA AGGCAGTTAC CATCTCTTCT
GCAGCTGAAA TTACAAACTT TAATACACTT AAGGCAGTAG TTGAGGATAT GACTGCTAAG
AAGGCAGAAC TTGGAATTGA AGGTGTCTTT GCTTCCACTT CTTTTGCTCC AGGTGAGGAT
TGGAGATGGC AGACACATTT AGCCAATTTA CCAATCTATT ATGAATTTTT AGATAAGAAG
GTGTCAGATG CTGATACTAT CGATTTTACA TACTCTGATA ATTATAAGAA TGTTTTTGAT
TTATACATAA ATAATTCCTG TACGGATAAG GGAGTGCTTG GAAGTAAAAG TGTTGCAGAT
TCTATGGCTG AATTTGCACT TGGAAAAGTT GCAATGGTCC AGAATGGTAA TTGGGGTTGG
GGACAGATTA ATGGTGTAGA AGGTAACACA GTAAAAGAAA CGGATGTAAA ATTTCTACCG
ATCTATACTG GAGTAGCAGG GGAAGAAAAG CAAGGATTAT GTACTGGAAC AGAGAACTTC
TTCTGTATTA ACAGCAAAAC CTCAAAAGCT AATCAAGAAG CATCCATTGC CTTTATTGAA
TGGCTTTATA ACTCTGAAAA AGGAAAAGAC TATGTTACAA ATAAGTTAGG TTTTATCGCT
CCATTCTCAA CTTTTAAGGA AAATGAAAAA CCAACAGATC CATTAGCAAA AGAAGTACTC
CGCTATATGT CTGACACAAG TAAGGTATCC GTTGCTTGGA ATTTTACTGC ATTTCCAAGT
CAGGCATTTA AGGACTATTT TGGCTCCAGC TTATTACAAT ATGCTCAGGA TAAAGATACT
TGGCAGGATG TAAAGGATTC CGTAATTAAT TACTGGAAAT TAGAAAAAGA GGCTACAAAG
TAA
 
Protein sequence
MKRKLISILL CLSLCMAFFA GCSKQDSKGK VYYLSFKPES DEIWKEIAKA YTEETGVEVV 
VKTAAAGTYE QTLKAEISKS NAPTLFQING PVGYQSWKDY CLDLQGTDLY SWLLDKSLAV
SSGDGVYGIP YVVEGYGIIY NDAIMQKYFA LSNKAVTISS AAEITNFNTL KAVVEDMTAK
KAELGIEGVF ASTSFAPGED WRWQTHLANL PIYYEFLDKK VSDADTIDFT YSDNYKNVFD
LYINNSCTDK GVLGSKSVAD SMAEFALGKV AMVQNGNWGW GQINGVEGNT VKETDVKFLP
IYTGVAGEEK QGLCTGTENF FCINSKTSKA NQEASIAFIE WLYNSEKGKD YVTNKLGFIA
PFSTFKENEK PTDPLAKEVL RYMSDTSKVS VAWNFTAFPS QAFKDYFGSS LLQYAQDKDT
WQDVKDSVIN YWKLEKEATK