Gene Moth_0882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0882 
Symbol 
ID3831520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp911455 
End bp914646 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content63% 
IMG OID637828812 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_429742 
Protein GI83589733 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATCG GTTCCGGCCC CATCATCATC GGTCAGGCGG CGGAATTTGA TTATGCCGGT 
ACCCAGGCCT GCCGGGCTTT AAAGGAAGAA GGAATGGAGG TCGTCCTGGT CAATTCCAAC
CCGGCGACCA TCATGACCGA CAGGGATATG GCCGACCGCG TTTACTTGGA ACCCCTGACC
CTGGATTTTG TAGCGAAGAT TGTCCGCCAG GAGCGGCCCG ACGGTCTGAT ACCCACCCTT
GGCGGTCAAA TGGGGCTGAA CCTGGCCATG GAACTGGCCG AAGCCGGGGT TCTAGAAGAA
ACGGGAGTCG AACTCCTGGG AACACCCCTG ACGGCCATCC AGCGGGCCGA GGACCGGGAG
CAGTTTAAGG AGATGATGCT GGCCATCGAC GAACCGGTAC CTGAGAGCCG GATTGTCAAT
CGGGTGGAGG AGGCCCTGGA GTTCGCCCGG GAAATCGGCT ATCCGGTCAT CGTGAGGCCG
GCCTATACCC TGGGCGGCAC CGGGGGCGGC GTGGCCCATA ACGAGGCCGA GCTCCGTTCC
ATTGCCCTGA AAGGGCTGAA GTTGAGCCTG ATCCAGCAGG TCCTGGTGGA GCGCTCCGTT
GCCGGCTGGA AGGAGATCGA GTACGAGGTC ATCCGCGACA GTAACGATAA CTGCATTACC
GTCTGCAATA TGGAGAATAT CGACCCGGTG GGCATTCATA CCGGCGACAG CATCGTCGTC
GCCCCTTCCC AGACCCTTTC CGACCGGGAG TATCACCTGT TGCGGCGCTC GGCCCTGAAA
ATCATCCGCG CCCTGGGTAT CGAAGGGGGT TGCAACGTCC AGTTCGCCCT GGACCCCGGG
AGCATGCGCT ATTACGTTAT CGAGGTCAAC CCCAGGGTCA GCCGTTCCAG CGCCCTGGCC
TCCAAGGCCA CCGGTTATCC CATCGCCAGG GTGGCCACCA AGATCGCGTT AGGGTTGACC
CTGGACGAGA TCCCCAACGC CGTCACCCGG GAAACAAAGG CCTGCTTTGA ACCGGCCCTG
GACTACGTCG TCGTCAAGAT CCCGCGTTTC CCCTTCGATA AATTCTCCCT GGCCGAGCGT
ATCCTCGGCA CCCAGATGAA GGCCACCGGG GAGGTCATGG CCATCGACCG CACCTTCAGC
GGCGCCCTGT TAAAGGCCGT CCGCTCCCTG GAGTTGAAGC TTGACGGCCT TAAGGTGGCG
GCCTTCCAGC GTTTCAGCGA CAGCGCCCTG CGCCGCAAGA TGGCTGAGGC GGACGACGAG
AGGCTCTTTG TCGTCGCCGA AGCCCTGCGC CGCGGCTGGA CCATCGCTTC CATCCACGAA
ATTACCGGCA TCGACCCCTA TTTCCTGGGT GAAATCGAGG CCATTGTGGC CATGGAAGAG
AAACTCGTCG CCGCCGGTCC CGCCCTGGAC GCGGCCACCT TAAAGCGCGC CAAAGCCATG
GGCTTCAGCG ACGGCGAGAT TGCCAACTTT ACCGGCCTGC CGCCGGCGGA CATTACCCGG
CTGCGGCAGG AAGAAGGCAT CCGCCCCACC TTTAAGATGG TGGATACCTG CGCCGCCGAG
TTTGAGGCCG TCACGCCCTA TTATTACTCC TGCTATGACG TGGAAGATGA GGTACACCCC
CTGGAGGGCC GCAAGGTGGC CGTCCTGGGG GCGGGGCCTA TCCGCATCGG CCAGGGAATC
GAGTTCGACT ACTGTTCGGT CCATGCCGCC TGGGCCCTGC GCCGGGCCGG CGTGCACCCC
ATAATGATCA ACAACAACCC GGAGACAGTC AGCACCGATT TCGACACCTC CGACCGCCTC
TACTTTGAAC CCCTGACACC GGAAAATGTC TTGAATGTCC TGGAAAAGGA GCAGCCGGAA
GGGGTGATCG TCCAGTTCGG CGGCCAGACG GCCATCAACC TGGCGCAAAC GGTGGCCGGC
GCCGGTTTTC CGGTCCTGGG GACGGCGGTG GTCGATATCG ACCGGGCCGA AGACCGGGAG
AAGTTCGACG CTTTGCTGAA CGAACTGGGT ATACCCCGGC CGCGGGGCGG GACGGCAACC
TCCGTGGGTG AAGTGGTAAA GATCGCCAAG GAGCTCGGTT TCCCGGTACT GGTACGGCCT
TCTTACGTCC TGGGCGGCAG GGCCATGGAG ATCGTCCACA GCGAAGGCGA GCTCCTGGAG
TACGCCACCA CCGCCGTCCG GGTGGCCCCG GAACACCCCG TCCTGGTGGA CAAATACCTG
CCTGGCACCG AGGTGGAGGT AGACGCCGTG AGCGACGGCG AGACCGTCCT CATCCCCGGT
ATCATGGAGC ACGTCGAACG GGCCGGCGTC CACTCCGGCG ACAGCATCGC TATTTACCCG
GCCCACAGCC TGCCGCCGGG GGTGACGGAA AAGATCGTCG CTTACACCGA GCAGCTGGCC
CGGGCCCTCC GGGTGCGCGG CCTCCTCAAT ATTCAATTTG TCATCCACCG GGGCGAGGTC
TACGTCCTGG AGGTAAATCC CCGTTCCAGC CGCACGGTAC CCTACCTCTC CAAGATTACC
GGCGTGCCCA TGGTGGCCCT GGCCACCAAC GTCATGCTGG GCAAAAGCCT GCCGGAGCAG
GGCTACCGGG GCGGCTTAAT GCCGCCGCCG GATTTTACCG CCGTAAAGGT CCCCGTCTTT
TCCTTCGGCA AGCTGTTGCA GGTGGACACC TCCCTGGGAC CGGAGATGAA GTCCACCGGC
GAGGTAATGG GGATTGATCC CGTCTTCGAA CGCGCCCTCT ATAAAGGCCT GGTAGCCGCC
GGCTGCTCCA TCCCCCATCA CGGCACCCTG CTGGCGACCA TCGCCGATAA GGACAAGGCG
GAAGCAGTGC CCATCATCAA GGGCTTTGCC GAACTGGGCT TCCAGGTGGT GGCTACCGCC
GGCACCGCCG GCGCCCTGGC CGCAGCGGGA CTCTTCGTAG AGAGGGTGGG GAAGATCCGC
GAGGGTTCGC CCCACATTAT CGACTATATC CGGGAAGGGA AGGTCCACTT TGTCCTCAAC
ACCCTCACCA GGGGCAAGAT GCCCGGCCGG GACGGTTTTA AGATCCGCCG CGCCGCGGCC
GAACTGGGCA TCCCCTGCCT GACTTCCCTG GATACGGCCC GGGCCCTGCT CAAAGTCCTC
CAGTCCCTGA AGTCCGGCGA CGGGTTTAAC CTCAAACCCC TGCAGGAGTA TGTACCCCTT
TCCCGTCCTT AA
 
Protein sequence
MVIGSGPIII GQAAEFDYAG TQACRALKEE GMEVVLVNSN PATIMTDRDM ADRVYLEPLT 
LDFVAKIVRQ ERPDGLIPTL GGQMGLNLAM ELAEAGVLEE TGVELLGTPL TAIQRAEDRE
QFKEMMLAID EPVPESRIVN RVEEALEFAR EIGYPVIVRP AYTLGGTGGG VAHNEAELRS
IALKGLKLSL IQQVLVERSV AGWKEIEYEV IRDSNDNCIT VCNMENIDPV GIHTGDSIVV
APSQTLSDRE YHLLRRSALK IIRALGIEGG CNVQFALDPG SMRYYVIEVN PRVSRSSALA
SKATGYPIAR VATKIALGLT LDEIPNAVTR ETKACFEPAL DYVVVKIPRF PFDKFSLAER
ILGTQMKATG EVMAIDRTFS GALLKAVRSL ELKLDGLKVA AFQRFSDSAL RRKMAEADDE
RLFVVAEALR RGWTIASIHE ITGIDPYFLG EIEAIVAMEE KLVAAGPALD AATLKRAKAM
GFSDGEIANF TGLPPADITR LRQEEGIRPT FKMVDTCAAE FEAVTPYYYS CYDVEDEVHP
LEGRKVAVLG AGPIRIGQGI EFDYCSVHAA WALRRAGVHP IMINNNPETV STDFDTSDRL
YFEPLTPENV LNVLEKEQPE GVIVQFGGQT AINLAQTVAG AGFPVLGTAV VDIDRAEDRE
KFDALLNELG IPRPRGGTAT SVGEVVKIAK ELGFPVLVRP SYVLGGRAME IVHSEGELLE
YATTAVRVAP EHPVLVDKYL PGTEVEVDAV SDGETVLIPG IMEHVERAGV HSGDSIAIYP
AHSLPPGVTE KIVAYTEQLA RALRVRGLLN IQFVIHRGEV YVLEVNPRSS RTVPYLSKIT
GVPMVALATN VMLGKSLPEQ GYRGGLMPPP DFTAVKVPVF SFGKLLQVDT SLGPEMKSTG
EVMGIDPVFE RALYKGLVAA GCSIPHHGTL LATIADKDKA EAVPIIKGFA ELGFQVVATA
GTAGALAAAG LFVERVGKIR EGSPHIIDYI REGKVHFVLN TLTRGKMPGR DGFKIRRAAA
ELGIPCLTSL DTARALLKVL QSLKSGDGFN LKPLQEYVPL SRP