Gene Cthe_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0949 
SymbolcarB 
ID4811242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1135725 
End bp1138943 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content43% 
IMG OID640106368 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001037376 
Protein GI125973466 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.742849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAA GGGATGATGT AAAAAAGGTC CTGGTTATCG GCTCGGGTCC TATTGTAATA 
GGCCAGGCGG CGGAATTTGA CTATGCCGGA ACCCAGGCCT GCAGGGCTTT AAAAGAGGAA
AACATAGAAG TGGTGCTGGT TAACAGCAAT CCTGCCACAA TAATGACGGA TACCAATATA
GCTGATAGAG TATATATTGA ACCTCTTACG GTGGAGGTTG TAAAGAAAAT TATAATAAAA
GAAAAACCGG ACAGCATACT TCCCACTTTG GGAGGGCAGA CAGGACTTAA TCTTGCCATG
GAGCTTGCAG AGTCAGGATT TTTGAAGGAA CATGGCGTAA AACTTCTGGG AACTGCCACC
GAGGCCATAA AAATGGCCGA AGACAGACAG GCTTTTAAAG ATACCATGGA AAGGATAGGA
GAGCCTTGTA TTGCCAGCAA GGTGGTAAAC ACAGTTGAGG ATGCGCTGGA TTTTGCAAAA
GAAATCGCTT ATCCGGTTGT TGTGCGTCCT GCCTATACTT TGGGGGGAAC CGGCGGCGGA
ATTGCTTACA ATGAAGAAGA ACTGAAAGAG ATTGCATCCA ATGGTTTGAG GCTCAGCAGA
GTACACCAGG TACTTATCGA GAAATGTATC GCCGGTTGGA AAGAAATTGA GTATGAGGTT
ATGCGCGACA GCAAGGGTAA TGTAATTACC GTATGTAACA TGGAAAATAT CGACCCGGTA
GGAGTTCATA CCGGAGACAG CATTGTTGTT GCGCCTTCCC AAACGCTTAC GGACAGAGAA
TATCAGATGC TTCGCTCCTC GGCACTGAAG ATAATATCGG CATTGGGCAT CGAGGGAGGC
TGTAACGTTC AGTTTGCTCT AAATCCCAAC AGTTTTGAAT ATGCGGTAAT AGAGGTCAAC
CCAAGGGTGA GCCGTTCCTC GGCATTGGCC TCAAAGGCAA CGGGTTACCC AATTGCAAAG
GTTGCCACCA AGATAGCCAT AGGTTATGGT CTTGATGAAA TAAAAAATGC AGTGACTGGA
AAGACTTTTG CGTGTTTTGA GCCTACTTTG GACTATGTTG TAATCAAAAT ACCCAAATGG
CCTTTTGACA AATTTGTAAA GGCGAAAAGG ACTTTGGGAA CCCAGATGAA AGCCACCGGT
GAAGTTATGG CTATAAGCAG CTCTTTTGAA GGGGCTCTGA TGAAAGCTTT AAGGTCGCTG
GAACTGGGTA TTTTCACACT GGAACAGGAT ATTTACAAAA AGTTCAGTGC CGAAGAAATA
AGGCAAAAGA TAAAAGATGT CAGTGATGAG AGGATTCTTG TTATTGCCGA GGCAATAAGA
AGGGGCGTAA CGGTTGAGGA AATCAACAAT GTTACAAAGA TAGATTTGTT CTTCTTAAAC
AAAATCAAAA ATCTGGTCCT TATGGAAGAA AAGTTAAAAA CCATGAAGCT TTCGGATTTT
GACGAAGAAA CCTTAAGAAC GGTTAAAAAG ATGGGCTTTA CCGATGCGGT TATAGCAAAA
TATGTTGGAT GCGACAAAAA GGAAGTTACG GCAAAAAGAA AAGAGCTTGG CATTTGTGCC
GTTTATAAAA TGGTTGATAC CTGTGCCGCA GAATTTGAGG CCATGACACC TTATTATTAC
TCTACCTATG ATGAATGCTG CGAGGCTAAG AAATCGGACA AGAAAAAGGT GCTGGTAATA
GGGTCGGGAC CCATAAGGAT AGGCCAGGGT ATTGAGTTTG ACTACTGCTC GGTTCATTCG
GTATGGGGAT TGAAGCAGGA AGGTTATGAG ACTATAATTG CAAACAACAA TCCTGAGACG
GTAAGTACGG ATTTTGACAC AGCGGATAGG CTTTACTTTG AACCGTTGAC TCCTGAAGAC
GTGGAGAACA TTGTTGAAAA AGAAAAAGTA GACGGGGCAA TTGTTCAGTT TGGCGGACAG
ACTGCGATTA AACTTACAAA AGCCCTTGTG GAAATGGGAG TAAAAGTTTT TGGCACCGAA
CCTAAATATA TTGATGCCGC CGAGGACCGT GAGAAATTTG ACAGGATTCT CGAAGAACTC
GGTATTCCAA GACCTAAAGG AAAGACTATA TTTACCCTCG ATGAAGCTTT GGAGGCTGCA
AACGAATTGG GCTATCCTGT ACTGGTAAGA CCCTCATATG TACTTGGCGG ACAGGGTATG
GAAATAGCTT ACAACGACAA AGACATAGTA GAATTCATGG AGATTATAAA CAGGGTAAAG
CAGGAACATC CCATCCTGAT AGACAAGTAT ATGATGGGCA AGGAAATTGA AGTGGATGCC
ATATCCGACG GTGAGGATAT TTTGATACCC GGTATAATGG AGCACTTGGA AAGAGCCGGT
GTTCACTCCG GGGACAGTAT ATCCGTTTAC CCGACGCAGA CAATAGGGGA AAAGCTCAAG
GAAAAGATTG TGGATTACAC TCAAAAGCTT GCCAAAGCGT TAAGAGTTGT TGGATTGATC
AATATCCAAT ATGTGTATTA CAATAACGAG CTTTATGTTA TCGAGGTAAA CCCGCGTTCA
AGCCGTACCG TTCCGTATAT AAGCAAGGTA ACGGGAATAC CTATGGTAAA TATCGCTACA
AGGATAATGA TGGGCAAAAA ACTCAAGGAT TTCAATTACG GAACGGGACT TTACAAAGAA
TCGGAGTATG TGGCGGTGAA GGTTCCGGTG TTCTCCTTTG AAAAGCTTCA TGACGTTGAT
ACAAGCCTGG GACCTGAAAT GAAGTCCACC GGTGAAGTTT TGGGTATAGC AAAAACTTTC
CCGGAGGCGC TGTACAAAGG AATTATTGCG ACAGGAATCA AGCTCCCTAA AAAAGGCGGC
GCGATACTGA TGACTGTCAG GGATACCGAC AAGCCTGAGC TTGTACAGCT GGCGGAAGAA
TTTGAAAAGC TTGGATTTGA GCTTTATGCC ACGGGAAAAA CCGCCAACAT GCTGAACAAC
CAGGGAATTG CCACCAATGC CGTGAAAAAA ATTGGCGAGG GTGAGCCAAA TCTTCTGAAT
TTGATTGAAT CAGGAAAAAT AAGCCTTATA ATCAATACTC CTACAAAAGG AAGACAGCCT
GAAAGGGATG GATTTAAAAT AAGGCGAAAA GCCGTTGAAA TGTCGATACC TTGCCTTACG
TCCCTTGATA CTGCACGGGC TGTGCTGGAA TGCATAAAAC TGGAGAAAGA GGAAAAAGAC
CTTGAAGTTA TAGATTTGAG CGTATTTGAT AACCAGTGA
 
Protein sequence
MPKRDDVKKV LVIGSGPIVI GQAAEFDYAG TQACRALKEE NIEVVLVNSN PATIMTDTNI 
ADRVYIEPLT VEVVKKIIIK EKPDSILPTL GGQTGLNLAM ELAESGFLKE HGVKLLGTAT
EAIKMAEDRQ AFKDTMERIG EPCIASKVVN TVEDALDFAK EIAYPVVVRP AYTLGGTGGG
IAYNEEELKE IASNGLRLSR VHQVLIEKCI AGWKEIEYEV MRDSKGNVIT VCNMENIDPV
GVHTGDSIVV APSQTLTDRE YQMLRSSALK IISALGIEGG CNVQFALNPN SFEYAVIEVN
PRVSRSSALA SKATGYPIAK VATKIAIGYG LDEIKNAVTG KTFACFEPTL DYVVIKIPKW
PFDKFVKAKR TLGTQMKATG EVMAISSSFE GALMKALRSL ELGIFTLEQD IYKKFSAEEI
RQKIKDVSDE RILVIAEAIR RGVTVEEINN VTKIDLFFLN KIKNLVLMEE KLKTMKLSDF
DEETLRTVKK MGFTDAVIAK YVGCDKKEVT AKRKELGICA VYKMVDTCAA EFEAMTPYYY
STYDECCEAK KSDKKKVLVI GSGPIRIGQG IEFDYCSVHS VWGLKQEGYE TIIANNNPET
VSTDFDTADR LYFEPLTPED VENIVEKEKV DGAIVQFGGQ TAIKLTKALV EMGVKVFGTE
PKYIDAAEDR EKFDRILEEL GIPRPKGKTI FTLDEALEAA NELGYPVLVR PSYVLGGQGM
EIAYNDKDIV EFMEIINRVK QEHPILIDKY MMGKEIEVDA ISDGEDILIP GIMEHLERAG
VHSGDSISVY PTQTIGEKLK EKIVDYTQKL AKALRVVGLI NIQYVYYNNE LYVIEVNPRS
SRTVPYISKV TGIPMVNIAT RIMMGKKLKD FNYGTGLYKE SEYVAVKVPV FSFEKLHDVD
TSLGPEMKST GEVLGIAKTF PEALYKGIIA TGIKLPKKGG AILMTVRDTD KPELVQLAEE
FEKLGFELYA TGKTANMLNN QGIATNAVKK IGEGEPNLLN LIESGKISLI INTPTKGRQP
ERDGFKIRRK AVEMSIPCLT SLDTARAVLE CIKLEKEEKD LEVIDLSVFD NQ