Gene Cphy_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2012 
Symbol 
ID5743040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2484588 
End bp2485658 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content37% 
IMG OID641293109 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001559119 
Protein GI160880151 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCAA AAAAAGTTAT CGCGATTTCC TTGACGGCAG TTATGATGTT AGGTCTACTG 
TCTGGGTGTA AGAAGGAGGA TTCTAATGAC AAAGCTACAA GCGGTTCCGG TAAAGAAATG
ACGGTTGAAA TTGTAGCAAA AGGTTTCCAA CATGACTTTT GGCAAGCAGT ATTAGCAGGA
AGTAAGAAAG CAGAAAAAGA ATTCAATGTT AAAACAAATT TCGTGGGTCC AGAGGGTGAA
GGTGCAATTG CAACGCAAGT AGAACAGATT AACAATGCAA TTAATAAAAA ACCTTCTGCA
ATATGTCTTG CAGCTCTTGA TACAAATGCA GCTTTAGATG CACTTAGTCA AGCTAAGTCA
CAAGGTATTC CAATCATTGG TTTTGACTCT GGTGTACCAG GTGCACCAGA AGGTTCTGTA
AAGGCAAATG CAGCAACAGA TAATTATGCA GCTGGTGAAT TAGCAGCTAC AAAAATGTAC
GAAGCAATTA AAGATAAGGT AACTAATCCA TCCAATGTTG TTCGTATTGG TGTAGTGTCC
CAAGAAGCTA ACTCTGATTC CATCATCAAA CGTACATCAG GTTTTGTTGA CAAGATGAGT
TCTTTAATTG GTGAAAATAA CTCCTGCGTA GAAGGTCATG ATAAGTATAA TCGTAAATCA
GATGGTGCGA AAGTTATCAT TGAAGTGCGT ATTCCAGCAG AAGTTACTGA TAACGCAGGT
AAGACAGAAG CATTAACATT ATTAAACAAA GAAGACTTAG TTGCTATTTA TGGATCTAAT
GAATTTGCAG CAAAATGTAT TATTAACGCA AATGAAGGTT TAAATAAGCT TGGTGAAGGT
AAAGTTATCG CAGTTGGTTT TGACTCTGGT GCACTTCAGA TTGATGCTAT TAAGAATAAG
GTATTCTATG GTTCTGTTAC ACAAGATCCA GTTTCCATTG GATATAATGC AGTACGTTTA
GCTGTTGCAG CAGCAAAGGG TCAGAAAGTA GAAGATGTGG ATACTGGTTG TCAGTGGTAT
AATTCAGAAA ATTATAATTC AGCCGATATT GCACCTTGTT TATATCAGTA A
 
Protein sequence
MRAKKVIAIS LTAVMMLGLL SGCKKEDSND KATSGSGKEM TVEIVAKGFQ HDFWQAVLAG 
SKKAEKEFNV KTNFVGPEGE GAIATQVEQI NNAINKKPSA ICLAALDTNA ALDALSQAKS
QGIPIIGFDS GVPGAPEGSV KANAATDNYA AGELAATKMY EAIKDKVTNP SNVVRIGVVS
QEANSDSIIK RTSGFVDKMS SLIGENNSCV EGHDKYNRKS DGAKVIIEVR IPAEVTDNAG
KTEALTLLNK EDLVAIYGSN EFAAKCIINA NEGLNKLGEG KVIAVGFDSG ALQIDAIKNK
VFYGSVTQDP VSIGYNAVRL AVAAAKGQKV EDVDTGCQWY NSENYNSADI APCLYQ