Gene Cthe_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1868 
Symbol 
ID4809199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2214537 
End bp2217740 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content43% 
IMG OID640107287 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_001038282 
Protein GI125974372 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA TAGATAAAAT AAAAAAGGTC CTTGTTATCG GTTCGGGACC TATAGTTATC 
GGTCAGGCTG CTGAATTCGA CTACTCAGGA ACACAGGCTT GCAAAGCTTT AAGGGAAGAA
GGAATTGAAG TCGTTCTTGT AAACAGCAAC CCTGCAACAA TAATGACCGA TACCCAATCA
GCGGACAGAG TATATATAGA ACCTATTACC TTGGATTTCG TCAAAAAAAT AATCAGAAGG
GAAAAACCCG ACGGGATTTT AGCGTCCCTG GGCGGGCAAA CCGGCCTGAA CATGGCAATA
CAGCTGGCGG AAGACGGAAT TTTGGATGAA ATGGGAATTG AGCTTCTGGG TACATCTTTG
GAATCAATCA GAAAAGCCGA AGACAGGGAG CTTTTCAAGA AAACCATGCA GGAAATAGGC
GAAAAAGTAC CCTTAAGCAC GATTGCAACC GACCTTGACA GTGCCGTTAA ATTTGCCGAA
GAAGTCGGCT TCCCCATCAT CATACGTCCG GCCTATACAT TGGGAGGCAC CGGCGGCGGT
ATCGCCCATA ACATGGAGGA ATTCAAATAT ATTTGCGGCA AAGGCTTAAA ACTCAGCCTC
ATACACCAGG TTCTTCTGGA GCAAAGCGTT GCAGGCTGGA AGGAAATTGA ATATGAAGTT
ATAAGGGACG GTGCCGACAA CAGTATCATC ATTTGCAACA TGGAAAACTT CGACCCTGTG
GGAGTCCACA CCGGCGACAG TATTGTCGTT GCCCCGTGCC AGACCTTGTC GGACGTTGAA
GCCCAGATGC TGAGAGCTGC TTCATTGAAA ATCATTCGTG CTCTTAACAT AAAAGGCGGC
TGCAATATTC AGTATGCATT AAACCCCAAC AGTTTTGAAT ATGTGGTGAT TGAAGTAAAT
CCAAGGGTAA GCCGTTCCAG CGCCCTTGCT TCCAAAGCAA CGGGATATCC TATTGCGCGT
GTTGCAGCCA AAATTGCAAT CGGTCTTAAC CTTGACGAAA TCAAGAACTC TGTTACTCAC
ACTACCTATG CCTGTTTCGA ACCTTCAATC GACTACGTGG TCACAAAAGT GCCCCGCTGG
CCTTTTGACA AATTTTCAAA TGCGGACAGG TCATTGGGCA CCCAGATGAA GGCAACCGGT
GAAGTAATGG CCATAGGCAG AACTTTTGAA GAATCACTGC TCAAAGCCAT AGACTCGTTG
GATATTAAAA TGAACTATCA GCTTGGTCTC AGCCTTTTTG ACAACAAGTC GGTGGAAGAA
CTTCTTGATT TTATCAAGAC ACCGAGCGAT GAAAGAATTT TCGCCATAAG CAAGGCACTT
CAGAAAGGAG TGTCACCGGA AGAAATCAGT AATATAACAA AAATTGACAT ATTCTTCATA
AAGAAGCTTG AAAAAATCGT CAAAGTGGCG GAAGAAATCA AAAATGCCGG TATTGCATGG
CTTGATTACG ACCTCTACTA CAAGGCAAAG AAAACCGGTT TTGGAGACTC GTATATTGCA
AATCTCATAA ACGTGCCTCT TGACACAATT TTAGAACTCA GAAACAAATA CCCCATCCGT
CCGGTGTACA AAATGGTTGA TACCTGTGCC GGAGAATTTG AAGCCGTAAC TCCGTACTAT
TATTCAACTT ATGAAGAAAC TGATGAGGTA GTTGTATCCG GTAAAAAGAA GGTTATAGTT
ATAGGTTCAG GACCTATAAG GATAGGCCAG GGGATTGAGT TTGACTACTG CAGTGTGCAC
TCGGTAAAAA CACTGAAAGA AATGGGATTT GAGGCAATTA TTATAAACAA CAACCCGGAA
ACGGTAAGTA CCGATTTTGA CACATCGGAC AAACTGTATT TTGAACCTCT CACAAAAGAG
TGCGTGCTTG ATATAATCGA AAAGGAAAAG CCTCTTGGAG TAATTGTTCA ATTTGGAGGG
CAGACTTCAA TAAATCTTGC GGGAACACTG GCAAAGGAAG GAGTAAATAT TCTCGGCACT
TCGGTTGAAA GCATTGATAT TGCCGAAGAC AGGGACAGAT TCCTAAACCT TTTGGAAGAA
CTGGGAATAC CATTGCCGGA AGGAGACACA GCTTTTTCCT ATGAAGAAGC AAAGGCGATA
GCCCAAAGAA TCGGATATCC TGTTCTCGTA AGGCCTTCAT ATGTTCTTGG CGGCCGAGCA
ATGGAAGTGG TGTACAACGA TGAGACCTTA AAAGAATACA TGCAGCTTGC TGTGGGTCTT
GCCACAAATC ATCCCGTTTT GATAGACAAA TATATAGAAG GCAAGGAAGT TGAAGTGGAT
GGTATTTGTG ACGGAGAGGA CGTACTTATT CCCGGCATTA TGGAACATAT TGAAAGAGCG
GGTGTTCACT CCGGAGACAG TATTTCAATA TATCCTCCCC AGACTCTTGA CGATGAAACA
AAGAATACCA TTGTGGATTA TACCATAAGG CTGGCAAAAG CATTAAAAAT AGTCGGTCTG
TTTAATATCC AGTTTGTTAT AGACCGTCAT AGCAAAGTAT ATGTTATTGA AGTTAATCCC
CGTGCAAGCC GTACAGTCCC TGTAATGAGC AAGGTAACCG GTATTCCTAT GGTTGATGTT
GCCACAAAGT TTATCATGGG ATATAAAATG AGAGACTTAG GATACACTCC CGGACTTTAC
AAAGAGTCTG AATTCGTAGC CGTAAAAGCT CCTGTATTCT CGTTCTCAAA ACTTACCACC
GTTGACACGT TCCTTGGACC GGAAATGAAG TCCACCGGTG AGGTAATGGG AATTGCAAAG
GATTATCATA TAGCACTGTA CAAAGCACTT GTCGCATCGG GAATTAAAAT CCCGTCCGGC
GGAAACGTAC TTCTTTCCAT AGCGGACAGA GACAAAACTG AATGCATAGA AATAGCTCAG
GCTCTTTCAG ACTTAGGTTT CAACCTTGTA GCTTCCGAGG GCACATATAC AAATCTCTCC
GGTGTCGGAA TAGAAGTGGA TATGGTAACG GATGACGAAA TGATTGAGAT GATAAAGAAA
GATAAAATAT CACTGGTTAT TAATACCCCA ACCCGGGGAA AAATACCTGA AAGGCATGGT
TTTATACTAA GAAGAACGGC AATTGAGTAT AATATTCCAT GTATTACTTC TCTGGATACC
GCCAGGTCAA TGATTTCAAT ACTGGAACAC ATGACATCCG GAGAGGAGAT TGAAATATAT
TCACTTGACG AATATTCAAA ATAA
 
Protein sequence
MPKIDKIKKV LVIGSGPIVI GQAAEFDYSG TQACKALREE GIEVVLVNSN PATIMTDTQS 
ADRVYIEPIT LDFVKKIIRR EKPDGILASL GGQTGLNMAI QLAEDGILDE MGIELLGTSL
ESIRKAEDRE LFKKTMQEIG EKVPLSTIAT DLDSAVKFAE EVGFPIIIRP AYTLGGTGGG
IAHNMEEFKY ICGKGLKLSL IHQVLLEQSV AGWKEIEYEV IRDGADNSII ICNMENFDPV
GVHTGDSIVV APCQTLSDVE AQMLRAASLK IIRALNIKGG CNIQYALNPN SFEYVVIEVN
PRVSRSSALA SKATGYPIAR VAAKIAIGLN LDEIKNSVTH TTYACFEPSI DYVVTKVPRW
PFDKFSNADR SLGTQMKATG EVMAIGRTFE ESLLKAIDSL DIKMNYQLGL SLFDNKSVEE
LLDFIKTPSD ERIFAISKAL QKGVSPEEIS NITKIDIFFI KKLEKIVKVA EEIKNAGIAW
LDYDLYYKAK KTGFGDSYIA NLINVPLDTI LELRNKYPIR PVYKMVDTCA GEFEAVTPYY
YSTYEETDEV VVSGKKKVIV IGSGPIRIGQ GIEFDYCSVH SVKTLKEMGF EAIIINNNPE
TVSTDFDTSD KLYFEPLTKE CVLDIIEKEK PLGVIVQFGG QTSINLAGTL AKEGVNILGT
SVESIDIAED RDRFLNLLEE LGIPLPEGDT AFSYEEAKAI AQRIGYPVLV RPSYVLGGRA
MEVVYNDETL KEYMQLAVGL ATNHPVLIDK YIEGKEVEVD GICDGEDVLI PGIMEHIERA
GVHSGDSISI YPPQTLDDET KNTIVDYTIR LAKALKIVGL FNIQFVIDRH SKVYVIEVNP
RASRTVPVMS KVTGIPMVDV ATKFIMGYKM RDLGYTPGLY KESEFVAVKA PVFSFSKLTT
VDTFLGPEMK STGEVMGIAK DYHIALYKAL VASGIKIPSG GNVLLSIADR DKTECIEIAQ
ALSDLGFNLV ASEGTYTNLS GVGIEVDMVT DDEMIEMIKK DKISLVINTP TRGKIPERHG
FILRRTAIEY NIPCITSLDT ARSMISILEH MTSGEEIEIY SLDEYSK