Gene Cthe_0950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0950 
Symbol 
ID4811243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1139034 
End bp1140107 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content43% 
IMG OID640106369 
Productcarbamoyl-phosphate synthase small subunit 
Protein accessionYP_001037377 
Protein GI125973467 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA TTTTAGCTCT TGAAGACGGC ACAATTTTTC ACGGCGAAAG TTTCGGAGCT 
CAGGGAGAAG TAATCGGAGA GATTGTTTTC AACACAGGTA TGACGGGCTA TCAGGAGGTT
TTGACAGATC CGTCTTACTG CGGACAGATT GTTGCAATGA CCTATCCTTT GATAGGCAAT
TACGGTGTAA ACAGTGAAGA TATAGAGTCG GAAAAGCCAC AAGTAAAGGG TTTTATTGTA
AGGGAGCTTT GTCAAAACCC AAGCAACTGG AGAGCCGAGG AGACGTTAAA CAACTATCTG
AAAAGAAATA ACATAATAGG AATTGAGAAA ATTGATACCA GGGCTCTTAC GAGGATTTTG
AGGGAAAAAG GAACGATGAA GGGAATGATT TCAACGGATC CGAATTTCAA TCTTGATGAC
AAGATTGACG AAATAAAAGC TTATGTTATA AAGGATCCGG TTATGTGTGT CACAACAAAA
GAAGTTTTGC ATTATAAAGG TGACGGATTT AAAGTTGCAT TGATAGATTT GGGCTTAAAG
AAAAATATTG TGCGCTCCCT TTTAAAAAGA GGATGTGACG TGCATGTTTT CCCTGCCAAT
TCCAAAGCGG AGGACATCCT TGCGATTAAT CCCGACGGAA TAATGCTTTC AAACGGACCG
GGGGATCCGA AGGATTGTGT TGAGACAATT GAGACCATAA AGAAGCTTAT GGGCAAAAAA
CCCATGTTTG GCATCTGCCT TGGGCATCAG CTTACAGCCC TTGCCAACGG TGCCGATACC
GAAAAACTCA AATACGGCCA CAGGGGAGCA AACCATCCGG TGAAGGACCT CGAAAAGGAC
CTGACATATA TTACTTCCCA AAACCATGGC TACACTATTG TTGAGTCATC CATGGACAAA
TCAAGGATGA CGGTAAGCCA CAGAAACATG AACGACGGCA CTGTCGAAGG CGTAAGGTAC
AAGGATATGC CGGTGTTTAC CGTGCAGTTT CATCCGGAAG CCTCACCGGG GCCTGAGGAC
ACGGCTTATC TGTTTGACGA GTTTATTGAT ATGATGAAAA AATATTCGCG TTAA
 
Protein sequence
MKAILALEDG TIFHGESFGA QGEVIGEIVF NTGMTGYQEV LTDPSYCGQI VAMTYPLIGN 
YGVNSEDIES EKPQVKGFIV RELCQNPSNW RAEETLNNYL KRNNIIGIEK IDTRALTRIL
REKGTMKGMI STDPNFNLDD KIDEIKAYVI KDPVMCVTTK EVLHYKGDGF KVALIDLGLK
KNIVRSLLKR GCDVHVFPAN SKAEDILAIN PDGIMLSNGP GDPKDCVETI ETIKKLMGKK
PMFGICLGHQ LTALANGADT EKLKYGHRGA NHPVKDLEKD LTYITSQNHG YTIVESSMDK
SRMTVSHRNM NDGTVEGVRY KDMPVFTVQF HPEASPGPED TAYLFDEFID MMKKYSR