Gene Moth_2409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2409 
SymbolpyrG 
ID3830776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2528723 
End bp2530336 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content58% 
IMG OID637830328 
ProductCTP synthetase 
Protein accessionYP_431234 
Protein GI83591225 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCA AATTTATTTT TGTTACCGGC GGTGTCACTT CTTCCCTGGG GAAGGGGATA 
ACCGCCGCTT CTTTAGGTAG ACTTCTAAAA AGCCGAGGCC TCAAAGTAGC CATCCAGAAG
TTCGACCCCT ATATCAATAT CGACCCCGGC ACCATGAGCC CGTACCAGCA TGGCGAGGTC
TTCGTCACCG ATGACGGCGC CGAAACTGAT CTGGACCTGG GCCATTACGA GCGCTTTATC
GACATTAGCC TTACCAAGGC CAGCAACGTC ACCGCCGGCA AGGTATACTG GTCCGTCATC
ACGAAAGAAC GGCGCGGCGA TTTCCTCGGC GGTACAGTCC AGGTCATACC CCACATCACC
AATGAGATCA AGGCCCGCCT CCTCCGGGTG GCCGAGGAGA GCGACCCGGA CGTTGTCATT
ACCGAGATTG GCGGTACTGT GGGGGATATC GAATCTCTGC CTTTCCTGGA AGCCATCCGC
CAGATGAAGA GCGATATCGG CCGCGATCGT GTCCTCTATA TCCATGTCAC CCTGGTTCCC
TACCTGCGGG CTGCCGGCGA AGCCAAAACC AAACCTACCC AGCACAGCGT CAAAGAGTTA
CGCAGCATTG GCATCCAGCC GGATATCATC GTCTGCCGGA CTGAACGTCC CTTCTCCCGG
GAAATGGAAG AAAAAATAGC TCTCTTTTGC GATATTGACC CCGATGCCGT CATCCAGGCC
TGGGATGCCG ATTCCATCTA TGAGGTCCCC CTGATGATGC AAGAGGAAGG CCTGGACAGC
ATCGTCGTCG AGCGGCTGAA GTTAAACTGC GGTCCTGCTC AAATGGACGA TTGGCGGGCC
ATGGTAGCAA AGTTAAAGAA TATCACCAGG CACCTGGAGA TCGCCCTGGT GGGCAAATAC
GTCACCCTGC CGGACGCCTA TTTAAGCGTA GTAGAATCCC TGCGCCATGC CGGCATGTAT
CACAACGTCC AGGTGGATAT TCGCTGGATT TATTCGGCTG ACCTGGAGCG GGGGGGCCTT
GAACAACTCC AGGATGTGGC CGGCATCCTG GTACCCGGGG GCTTTGGCGA CCGGGGGGTT
GAAGGGAAGA TCATAGCCGC CCGGTATGCC CGGGAGCATG GTATACCCTT CCTGGGTATT
TGCCTGGGGA TGCAGCTGGC AGTGGTTGAG TTTGCCCGTC ACGTCTGCGG ACTGGAAGCG
GCCAACAGCT CGGAATTCAA CCCGGAAACG CCCCACCCAG TCATCGACCT TTTGCCGGAG
CAAAAGGAGA TTGAAGATAA GGGTGGTACC ATGCGCCTGG GCCTCTATCC CTGCCGCTTA
CAGCCCGGTA CCCGGGCCCA CCAGGCCTAT GGCGAAGAAA TTATCTATGA ACGCCATCGC
CATCGCTATG AATTTAATAA CAACTACCGG GCCGAACTGA CGGCCAGGGG TATGGTTATC
AGCGGCACCT CCCCGGACGA CCGCCTGGTT GAGATTATTG AGCTGGCGGA TCACCCGTGG
TTTGTGGCCT GCCAGTTCCA TCCGGAATTC AAATCCCGGC CTAACCGGCC GCATCCCCTT
TTCCGGGACT TCATCGGCGC CGCCTGCCGG CGGGCCGGGG GGGGTGCAGG CTGA
 
Protein sequence
MPAKFIFVTG GVTSSLGKGI TAASLGRLLK SRGLKVAIQK FDPYINIDPG TMSPYQHGEV 
FVTDDGAETD LDLGHYERFI DISLTKASNV TAGKVYWSVI TKERRGDFLG GTVQVIPHIT
NEIKARLLRV AEESDPDVVI TEIGGTVGDI ESLPFLEAIR QMKSDIGRDR VLYIHVTLVP
YLRAAGEAKT KPTQHSVKEL RSIGIQPDII VCRTERPFSR EMEEKIALFC DIDPDAVIQA
WDADSIYEVP LMMQEEGLDS IVVERLKLNC GPAQMDDWRA MVAKLKNITR HLEIALVGKY
VTLPDAYLSV VESLRHAGMY HNVQVDIRWI YSADLERGGL EQLQDVAGIL VPGGFGDRGV
EGKIIAARYA REHGIPFLGI CLGMQLAVVE FARHVCGLEA ANSSEFNPET PHPVIDLLPE
QKEIEDKGGT MRLGLYPCRL QPGTRAHQAY GEEIIYERHR HRYEFNNNYR AELTARGMVI
SGTSPDDRLV EIIELADHPW FVACQFHPEF KSRPNRPHPL FRDFIGAACR RAGGGAG