Gene Ccel_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3331 
Symbol 
ID7311902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3874463 
End bp3875383 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content41% 
IMG OID643610234 
Producthomoserine O-succinyltransferase 
Protein accessionYP_002507600 
Protein GI220930691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1897] Homoserine trans-succinylase 
TIGRFAM ID[TIGR01001] homoserine O-succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00132184 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCA GAATACCTGA CAATCTGCCT GCGGTGGAAA TCCTGAATAG TGAGAACATA 
TTTGTCATGT CTGAGGACAG GGCATACCAT CAGGACATAA GACCGCTTAA AATTGCTATA
CTAAATATAA TGCCGACTAA AATAACTACC GAGACGCAGC TGCTCAGACT TCTTGGGAAC
ACTCCAATAC AGGTCGAAAT CATACTGCTG CGTCCGGCAT CGCATTTGTC GAAAAATACT
CCCGAGGAAC ACCTTGAAAC ATTTTATAAA ACCTTTGACG AGGTACAAAG AGAGCATTTC
GACGGACTGA TTATTACGGG AGCACCTGTT GAACAAATGC CCTTTGAAGA TGTAAATTAT
TGGGAAGAGC TAAAACAGAT AATGGACTGG AGTAAGAAAA ACGTCTACTC TACACTGCAT
ATATGCTGGG GAGCACAGGC GGGTCTGTAT TATCACTACG GTGTACCAAA ACGTGATTTA
CCTAAGAAGG TGTTTGGTGT ATTTGAACAT AATAAAATAA AGGAGCATGT AAAACTTCTC
AGAGGATTTG ACGATATCTT TTTTGTACCA CACTCCAGAC ATACTGAGGT TACTCATGAA
GATGTGGCAA AAATAGATGA TTTGGAAATA CTTGCCGAAT CAAAGGAGGC TGGGGTATAT
CTGGTAGCAT CAAAAGACGG ACGTCATATA TTTGCTACTG GACATTCCGA ATATGACCCG
TATACCCTTA AATTTGAGTA CGAGAGAGAT TTGGAAAAGG GGTTGGATAT AGAGGTTCCC
AAAAACTACT TCCCCGGGGA TGATCCTGCA AAGGAGCCTA TTGTCAGATG GCGAGGCCAT
GGAAACCTTT TATTTTTAAA TTGGCTTAAC TATTATGTAT ACCAGGAAAC ACCTTACGAT
TTAACTAACA TAGGAAAGTA A
 
Protein sequence
MPIRIPDNLP AVEILNSENI FVMSEDRAYH QDIRPLKIAI LNIMPTKITT ETQLLRLLGN 
TPIQVEIILL RPASHLSKNT PEEHLETFYK TFDEVQREHF DGLIITGAPV EQMPFEDVNY
WEELKQIMDW SKKNVYSTLH ICWGAQAGLY YHYGVPKRDL PKKVFGVFEH NKIKEHVKLL
RGFDDIFFVP HSRHTEVTHE DVAKIDDLEI LAESKEAGVY LVASKDGRHI FATGHSEYDP
YTLKFEYERD LEKGLDIEVP KNYFPGDDPA KEPIVRWRGH GNLLFLNWLN YYVYQETPYD
LTNIGK