Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0949 |
Symbol | carB |
ID | 4811242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1135725 |
End bp | 1138943 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106368 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001037376 |
Protein GI | 125973466 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.742849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAA GGGATGATGT AAAAAAGGTC CTGGTTATCG GCTCGGGTCC TATTGTAATA GGCCAGGCGG CGGAATTTGA CTATGCCGGA ACCCAGGCCT GCAGGGCTTT AAAAGAGGAA AACATAGAAG TGGTGCTGGT TAACAGCAAT CCTGCCACAA TAATGACGGA TACCAATATA GCTGATAGAG TATATATTGA ACCTCTTACG GTGGAGGTTG TAAAGAAAAT TATAATAAAA GAAAAACCGG ACAGCATACT TCCCACTTTG GGAGGGCAGA CAGGACTTAA TCTTGCCATG GAGCTTGCAG AGTCAGGATT TTTGAAGGAA CATGGCGTAA AACTTCTGGG AACTGCCACC GAGGCCATAA AAATGGCCGA AGACAGACAG GCTTTTAAAG ATACCATGGA AAGGATAGGA GAGCCTTGTA TTGCCAGCAA GGTGGTAAAC ACAGTTGAGG ATGCGCTGGA TTTTGCAAAA GAAATCGCTT ATCCGGTTGT TGTGCGTCCT GCCTATACTT TGGGGGGAAC CGGCGGCGGA ATTGCTTACA ATGAAGAAGA ACTGAAAGAG ATTGCATCCA ATGGTTTGAG GCTCAGCAGA GTACACCAGG TACTTATCGA GAAATGTATC GCCGGTTGGA AAGAAATTGA GTATGAGGTT ATGCGCGACA GCAAGGGTAA TGTAATTACC GTATGTAACA TGGAAAATAT CGACCCGGTA GGAGTTCATA CCGGAGACAG CATTGTTGTT GCGCCTTCCC AAACGCTTAC GGACAGAGAA TATCAGATGC TTCGCTCCTC GGCACTGAAG ATAATATCGG CATTGGGCAT CGAGGGAGGC TGTAACGTTC AGTTTGCTCT AAATCCCAAC AGTTTTGAAT ATGCGGTAAT AGAGGTCAAC CCAAGGGTGA GCCGTTCCTC GGCATTGGCC TCAAAGGCAA CGGGTTACCC AATTGCAAAG GTTGCCACCA AGATAGCCAT AGGTTATGGT CTTGATGAAA TAAAAAATGC AGTGACTGGA AAGACTTTTG CGTGTTTTGA GCCTACTTTG GACTATGTTG TAATCAAAAT ACCCAAATGG CCTTTTGACA AATTTGTAAA GGCGAAAAGG ACTTTGGGAA CCCAGATGAA AGCCACCGGT GAAGTTATGG CTATAAGCAG CTCTTTTGAA GGGGCTCTGA TGAAAGCTTT AAGGTCGCTG GAACTGGGTA TTTTCACACT GGAACAGGAT ATTTACAAAA AGTTCAGTGC CGAAGAAATA AGGCAAAAGA TAAAAGATGT CAGTGATGAG AGGATTCTTG TTATTGCCGA GGCAATAAGA AGGGGCGTAA CGGTTGAGGA AATCAACAAT GTTACAAAGA TAGATTTGTT CTTCTTAAAC AAAATCAAAA ATCTGGTCCT TATGGAAGAA AAGTTAAAAA CCATGAAGCT TTCGGATTTT GACGAAGAAA CCTTAAGAAC GGTTAAAAAG ATGGGCTTTA CCGATGCGGT TATAGCAAAA TATGTTGGAT GCGACAAAAA GGAAGTTACG GCAAAAAGAA AAGAGCTTGG CATTTGTGCC GTTTATAAAA TGGTTGATAC CTGTGCCGCA GAATTTGAGG CCATGACACC TTATTATTAC TCTACCTATG ATGAATGCTG CGAGGCTAAG AAATCGGACA AGAAAAAGGT GCTGGTAATA GGGTCGGGAC CCATAAGGAT AGGCCAGGGT ATTGAGTTTG ACTACTGCTC GGTTCATTCG GTATGGGGAT TGAAGCAGGA AGGTTATGAG ACTATAATTG CAAACAACAA TCCTGAGACG GTAAGTACGG ATTTTGACAC AGCGGATAGG CTTTACTTTG AACCGTTGAC TCCTGAAGAC GTGGAGAACA TTGTTGAAAA AGAAAAAGTA GACGGGGCAA TTGTTCAGTT TGGCGGACAG ACTGCGATTA AACTTACAAA AGCCCTTGTG GAAATGGGAG TAAAAGTTTT TGGCACCGAA CCTAAATATA TTGATGCCGC CGAGGACCGT GAGAAATTTG ACAGGATTCT CGAAGAACTC GGTATTCCAA GACCTAAAGG AAAGACTATA TTTACCCTCG ATGAAGCTTT GGAGGCTGCA AACGAATTGG GCTATCCTGT ACTGGTAAGA CCCTCATATG TACTTGGCGG ACAGGGTATG GAAATAGCTT ACAACGACAA AGACATAGTA GAATTCATGG AGATTATAAA CAGGGTAAAG CAGGAACATC CCATCCTGAT AGACAAGTAT ATGATGGGCA AGGAAATTGA AGTGGATGCC ATATCCGACG GTGAGGATAT TTTGATACCC GGTATAATGG AGCACTTGGA AAGAGCCGGT GTTCACTCCG GGGACAGTAT ATCCGTTTAC CCGACGCAGA CAATAGGGGA AAAGCTCAAG GAAAAGATTG TGGATTACAC TCAAAAGCTT GCCAAAGCGT TAAGAGTTGT TGGATTGATC AATATCCAAT ATGTGTATTA CAATAACGAG CTTTATGTTA TCGAGGTAAA CCCGCGTTCA AGCCGTACCG TTCCGTATAT AAGCAAGGTA ACGGGAATAC CTATGGTAAA TATCGCTACA AGGATAATGA TGGGCAAAAA ACTCAAGGAT TTCAATTACG GAACGGGACT TTACAAAGAA TCGGAGTATG TGGCGGTGAA GGTTCCGGTG TTCTCCTTTG AAAAGCTTCA TGACGTTGAT ACAAGCCTGG GACCTGAAAT GAAGTCCACC GGTGAAGTTT TGGGTATAGC AAAAACTTTC CCGGAGGCGC TGTACAAAGG AATTATTGCG ACAGGAATCA AGCTCCCTAA AAAAGGCGGC GCGATACTGA TGACTGTCAG GGATACCGAC AAGCCTGAGC TTGTACAGCT GGCGGAAGAA TTTGAAAAGC TTGGATTTGA GCTTTATGCC ACGGGAAAAA CCGCCAACAT GCTGAACAAC CAGGGAATTG CCACCAATGC CGTGAAAAAA ATTGGCGAGG GTGAGCCAAA TCTTCTGAAT TTGATTGAAT CAGGAAAAAT AAGCCTTATA ATCAATACTC CTACAAAAGG AAGACAGCCT GAAAGGGATG GATTTAAAAT AAGGCGAAAA GCCGTTGAAA TGTCGATACC TTGCCTTACG TCCCTTGATA CTGCACGGGC TGTGCTGGAA TGCATAAAAC TGGAGAAAGA GGAAAAAGAC CTTGAAGTTA TAGATTTGAG CGTATTTGAT AACCAGTGA
|
Protein sequence | MPKRDDVKKV LVIGSGPIVI GQAAEFDYAG TQACRALKEE NIEVVLVNSN PATIMTDTNI ADRVYIEPLT VEVVKKIIIK EKPDSILPTL GGQTGLNLAM ELAESGFLKE HGVKLLGTAT EAIKMAEDRQ AFKDTMERIG EPCIASKVVN TVEDALDFAK EIAYPVVVRP AYTLGGTGGG IAYNEEELKE IASNGLRLSR VHQVLIEKCI AGWKEIEYEV MRDSKGNVIT VCNMENIDPV GVHTGDSIVV APSQTLTDRE YQMLRSSALK IISALGIEGG CNVQFALNPN SFEYAVIEVN PRVSRSSALA SKATGYPIAK VATKIAIGYG LDEIKNAVTG KTFACFEPTL DYVVIKIPKW PFDKFVKAKR TLGTQMKATG EVMAISSSFE GALMKALRSL ELGIFTLEQD IYKKFSAEEI RQKIKDVSDE RILVIAEAIR RGVTVEEINN VTKIDLFFLN KIKNLVLMEE KLKTMKLSDF DEETLRTVKK MGFTDAVIAK YVGCDKKEVT AKRKELGICA VYKMVDTCAA EFEAMTPYYY STYDECCEAK KSDKKKVLVI GSGPIRIGQG IEFDYCSVHS VWGLKQEGYE TIIANNNPET VSTDFDTADR LYFEPLTPED VENIVEKEKV DGAIVQFGGQ TAIKLTKALV EMGVKVFGTE PKYIDAAEDR EKFDRILEEL GIPRPKGKTI FTLDEALEAA NELGYPVLVR PSYVLGGQGM EIAYNDKDIV EFMEIINRVK QEHPILIDKY MMGKEIEVDA ISDGEDILIP GIMEHLERAG VHSGDSISVY PTQTIGEKLK EKIVDYTQKL AKALRVVGLI NIQYVYYNNE LYVIEVNPRS SRTVPYISKV TGIPMVNIAT RIMMGKKLKD FNYGTGLYKE SEYVAVKVPV FSFEKLHDVD TSLGPEMKST GEVLGIAKTF PEALYKGIIA TGIKLPKKGG AILMTVRDTD KPELVQLAEE FEKLGFELYA TGKTANMLNN QGIATNAVKK IGEGEPNLLN LIESGKISLI INTPTKGRQP ERDGFKIRRK AVEMSIPCLT SLDTARAVLE CIKLEKEEKD LEVIDLSVFD NQ
|
| |