Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1868 |
Symbol | |
ID | 4809199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2214537 |
End bp | 2217740 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107287 |
Product | carbamoyl-phosphate synthase large subunit |
Protein accession | YP_001038282 |
Protein GI | 125974372 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAA TAGATAAAAT AAAAAAGGTC CTTGTTATCG GTTCGGGACC TATAGTTATC GGTCAGGCTG CTGAATTCGA CTACTCAGGA ACACAGGCTT GCAAAGCTTT AAGGGAAGAA GGAATTGAAG TCGTTCTTGT AAACAGCAAC CCTGCAACAA TAATGACCGA TACCCAATCA GCGGACAGAG TATATATAGA ACCTATTACC TTGGATTTCG TCAAAAAAAT AATCAGAAGG GAAAAACCCG ACGGGATTTT AGCGTCCCTG GGCGGGCAAA CCGGCCTGAA CATGGCAATA CAGCTGGCGG AAGACGGAAT TTTGGATGAA ATGGGAATTG AGCTTCTGGG TACATCTTTG GAATCAATCA GAAAAGCCGA AGACAGGGAG CTTTTCAAGA AAACCATGCA GGAAATAGGC GAAAAAGTAC CCTTAAGCAC GATTGCAACC GACCTTGACA GTGCCGTTAA ATTTGCCGAA GAAGTCGGCT TCCCCATCAT CATACGTCCG GCCTATACAT TGGGAGGCAC CGGCGGCGGT ATCGCCCATA ACATGGAGGA ATTCAAATAT ATTTGCGGCA AAGGCTTAAA ACTCAGCCTC ATACACCAGG TTCTTCTGGA GCAAAGCGTT GCAGGCTGGA AGGAAATTGA ATATGAAGTT ATAAGGGACG GTGCCGACAA CAGTATCATC ATTTGCAACA TGGAAAACTT CGACCCTGTG GGAGTCCACA CCGGCGACAG TATTGTCGTT GCCCCGTGCC AGACCTTGTC GGACGTTGAA GCCCAGATGC TGAGAGCTGC TTCATTGAAA ATCATTCGTG CTCTTAACAT AAAAGGCGGC TGCAATATTC AGTATGCATT AAACCCCAAC AGTTTTGAAT ATGTGGTGAT TGAAGTAAAT CCAAGGGTAA GCCGTTCCAG CGCCCTTGCT TCCAAAGCAA CGGGATATCC TATTGCGCGT GTTGCAGCCA AAATTGCAAT CGGTCTTAAC CTTGACGAAA TCAAGAACTC TGTTACTCAC ACTACCTATG CCTGTTTCGA ACCTTCAATC GACTACGTGG TCACAAAAGT GCCCCGCTGG CCTTTTGACA AATTTTCAAA TGCGGACAGG TCATTGGGCA CCCAGATGAA GGCAACCGGT GAAGTAATGG CCATAGGCAG AACTTTTGAA GAATCACTGC TCAAAGCCAT AGACTCGTTG GATATTAAAA TGAACTATCA GCTTGGTCTC AGCCTTTTTG ACAACAAGTC GGTGGAAGAA CTTCTTGATT TTATCAAGAC ACCGAGCGAT GAAAGAATTT TCGCCATAAG CAAGGCACTT CAGAAAGGAG TGTCACCGGA AGAAATCAGT AATATAACAA AAATTGACAT ATTCTTCATA AAGAAGCTTG AAAAAATCGT CAAAGTGGCG GAAGAAATCA AAAATGCCGG TATTGCATGG CTTGATTACG ACCTCTACTA CAAGGCAAAG AAAACCGGTT TTGGAGACTC GTATATTGCA AATCTCATAA ACGTGCCTCT TGACACAATT TTAGAACTCA GAAACAAATA CCCCATCCGT CCGGTGTACA AAATGGTTGA TACCTGTGCC GGAGAATTTG AAGCCGTAAC TCCGTACTAT TATTCAACTT ATGAAGAAAC TGATGAGGTA GTTGTATCCG GTAAAAAGAA GGTTATAGTT ATAGGTTCAG GACCTATAAG GATAGGCCAG GGGATTGAGT TTGACTACTG CAGTGTGCAC TCGGTAAAAA CACTGAAAGA AATGGGATTT GAGGCAATTA TTATAAACAA CAACCCGGAA ACGGTAAGTA CCGATTTTGA CACATCGGAC AAACTGTATT TTGAACCTCT CACAAAAGAG TGCGTGCTTG ATATAATCGA AAAGGAAAAG CCTCTTGGAG TAATTGTTCA ATTTGGAGGG CAGACTTCAA TAAATCTTGC GGGAACACTG GCAAAGGAAG GAGTAAATAT TCTCGGCACT TCGGTTGAAA GCATTGATAT TGCCGAAGAC AGGGACAGAT TCCTAAACCT TTTGGAAGAA CTGGGAATAC CATTGCCGGA AGGAGACACA GCTTTTTCCT ATGAAGAAGC AAAGGCGATA GCCCAAAGAA TCGGATATCC TGTTCTCGTA AGGCCTTCAT ATGTTCTTGG CGGCCGAGCA ATGGAAGTGG TGTACAACGA TGAGACCTTA AAAGAATACA TGCAGCTTGC TGTGGGTCTT GCCACAAATC ATCCCGTTTT GATAGACAAA TATATAGAAG GCAAGGAAGT TGAAGTGGAT GGTATTTGTG ACGGAGAGGA CGTACTTATT CCCGGCATTA TGGAACATAT TGAAAGAGCG GGTGTTCACT CCGGAGACAG TATTTCAATA TATCCTCCCC AGACTCTTGA CGATGAAACA AAGAATACCA TTGTGGATTA TACCATAAGG CTGGCAAAAG CATTAAAAAT AGTCGGTCTG TTTAATATCC AGTTTGTTAT AGACCGTCAT AGCAAAGTAT ATGTTATTGA AGTTAATCCC CGTGCAAGCC GTACAGTCCC TGTAATGAGC AAGGTAACCG GTATTCCTAT GGTTGATGTT GCCACAAAGT TTATCATGGG ATATAAAATG AGAGACTTAG GATACACTCC CGGACTTTAC AAAGAGTCTG AATTCGTAGC CGTAAAAGCT CCTGTATTCT CGTTCTCAAA ACTTACCACC GTTGACACGT TCCTTGGACC GGAAATGAAG TCCACCGGTG AGGTAATGGG AATTGCAAAG GATTATCATA TAGCACTGTA CAAAGCACTT GTCGCATCGG GAATTAAAAT CCCGTCCGGC GGAAACGTAC TTCTTTCCAT AGCGGACAGA GACAAAACTG AATGCATAGA AATAGCTCAG GCTCTTTCAG ACTTAGGTTT CAACCTTGTA GCTTCCGAGG GCACATATAC AAATCTCTCC GGTGTCGGAA TAGAAGTGGA TATGGTAACG GATGACGAAA TGATTGAGAT GATAAAGAAA GATAAAATAT CACTGGTTAT TAATACCCCA ACCCGGGGAA AAATACCTGA AAGGCATGGT TTTATACTAA GAAGAACGGC AATTGAGTAT AATATTCCAT GTATTACTTC TCTGGATACC GCCAGGTCAA TGATTTCAAT ACTGGAACAC ATGACATCCG GAGAGGAGAT TGAAATATAT TCACTTGACG AATATTCAAA ATAA
|
Protein sequence | MPKIDKIKKV LVIGSGPIVI GQAAEFDYSG TQACKALREE GIEVVLVNSN PATIMTDTQS ADRVYIEPIT LDFVKKIIRR EKPDGILASL GGQTGLNMAI QLAEDGILDE MGIELLGTSL ESIRKAEDRE LFKKTMQEIG EKVPLSTIAT DLDSAVKFAE EVGFPIIIRP AYTLGGTGGG IAHNMEEFKY ICGKGLKLSL IHQVLLEQSV AGWKEIEYEV IRDGADNSII ICNMENFDPV GVHTGDSIVV APCQTLSDVE AQMLRAASLK IIRALNIKGG CNIQYALNPN SFEYVVIEVN PRVSRSSALA SKATGYPIAR VAAKIAIGLN LDEIKNSVTH TTYACFEPSI DYVVTKVPRW PFDKFSNADR SLGTQMKATG EVMAIGRTFE ESLLKAIDSL DIKMNYQLGL SLFDNKSVEE LLDFIKTPSD ERIFAISKAL QKGVSPEEIS NITKIDIFFI KKLEKIVKVA EEIKNAGIAW LDYDLYYKAK KTGFGDSYIA NLINVPLDTI LELRNKYPIR PVYKMVDTCA GEFEAVTPYY YSTYEETDEV VVSGKKKVIV IGSGPIRIGQ GIEFDYCSVH SVKTLKEMGF EAIIINNNPE TVSTDFDTSD KLYFEPLTKE CVLDIIEKEK PLGVIVQFGG QTSINLAGTL AKEGVNILGT SVESIDIAED RDRFLNLLEE LGIPLPEGDT AFSYEEAKAI AQRIGYPVLV RPSYVLGGRA MEVVYNDETL KEYMQLAVGL ATNHPVLIDK YIEGKEVEVD GICDGEDVLI PGIMEHIERA GVHSGDSISI YPPQTLDDET KNTIVDYTIR LAKALKIVGL FNIQFVIDRH SKVYVIEVNP RASRTVPVMS KVTGIPMVDV ATKFIMGYKM RDLGYTPGLY KESEFVAVKA PVFSFSKLTT VDTFLGPEMK STGEVMGIAK DYHIALYKAL VASGIKIPSG GNVLLSIADR DKTECIEIAQ ALSDLGFNLV ASEGTYTNLS GVGIEVDMVT DDEMIEMIKK DKISLVINTP TRGKIPERHG FILRRTAIEY NIPCITSLDT ARSMISILEH MTSGEEIEIY SLDEYSK
|
| |