Gene Athe_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0078 
SymbolpyrG 
ID7407315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp99273 
End bp100898 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content41% 
IMG OID643714488 
ProductCTP synthetase 
Protein accessionYP_002572011 
Protein GI222528129 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAC AAGTCAAATA CATCTTTGTC ACAGGCGGGG TAGTGTCCGG GCTTGGCAAA 
GGAATCACTG CAGCATCAAT TGGAAGGCTA TTAAAAGCGC GGGGCTTAAA AGTTACCATG
CAAAAGTTTG ACCCATATAT AAACGTTGAC CCTGGAACTA TGAGTCCTTA CCAGCACGGA
GAGGTGTTTG TCACAGACGA TGGCGCTGAG ACAGACTTAG ACCTTGGTCA CTATGAAAGG
TTTATCGATG AGAACCTCAC AAAAAACAGC AATGTCACAA CAGGAAAGAT TTACTGGTCT
GTAATCCAAA GGGAGCGACG TGGCGACTTT CTTGGTGGCA CTGTACAGGT AATCCCTCAC
ATTACAAATG AAATTAAAGA GAGAATCTAC AGGCTTGGTA AGAGCAACTC CACAGATGTT
GTTATAACAG AAATTGGTGG GACTGTTGGC GATATTGAAA GCCTTCCGTT TTTAGAGGCT
ATAAGACAGG TTGCAACAGA CATTGGAAAA GAAAATGTTC TGTATGTTCA TGTAACTCTT
GTTCCTTACC TATCAAAGTC AGGAGAACTT AAAACAAAAC CAACACAGCA CTCTGTAAAA
GAGCTTAGAT CAATAGGTAT TCAGCCGGAT ATTATTGTTT GCAGAACAGA AAAGCCACTT
TCACAGGAAC TTAAGGCAAA GATTGCCTTG TTTTGTAATT TAAAGCCTGA ATATGTAATT
CAAAATATTG ATGCAGAAAG CCTGTATGAA GTGCCTTTGA TGCTTGAAAA AGAAGGGCTT
GGGGAGATTA TATGCGAAAA GCTTGGATTT GTCTGCACAA AACCAGACTT ATCTGACTGG
ATTGAGATAG TGGAAAAAGA AAAAAATCTC AAAAAGAGTG TCAGAATTGC GCTTGTTGGA
AAGTATGTTG AGCTTCATGA TGCATATCTC TCTGTTGCGG AGGCACTAAA ACATGCGGGA
ATTGCAAATG ATAGCTATGT TGAGATTCTA TGGACAAACG CAGAGGAAGT CACTTACGAC
AATGCGCACG AAAAACTTAA AAGTGCAGAC GGAATTTTAG TGCCGGGCGG ATTTGGTGAC
AGGGGTATTG AGGGTAAGAT TGCAGCCATA AGGTATGCAA GAGAAAACAA AATTCCGTTT
TTTGGAATAT GCCTTGGAAT GCAATGTGCT GTAATTGAGT TTGCAAGAAA TGTGCTTGGG
CTTGAAAGAG CAAACTCAAC AGAGTTTGAT GAAGCAACAC CATATCCTGT GATTGACATC
ATGCCAGAGC AAAAGGACGT ATTCACAAAA GGCGGCACTA TGCGTCTTGG ACTTTATCCG
TGTAAGCTTG AAGAAGGTAC TCTTGCCCAC AGAATTTACA ACGATGAGCT TGTATATGAA
AGGCACAGAC ACAGGTATGA GTTTAATAAT GAATACAAGG AAAAATTCAA GCAAGCCGGC
ATGGTATTTT CAGGAATATC ACCGGACAGA AGGCTTGTAG AGATAATAGA GCTGAAAGAC
CATCCATGGT TTTTGGGTGT GCAATTTCAT CCTGAGTTCA AGTCGCGCCC GCAAAGACCT
CATCCAATTT TTACAGATTT TATAAGAGCA TCACTTGAGA ATAGACAGAA AAAAGAGGGG
ATTTAA
 
Protein sequence
MEKQVKYIFV TGGVVSGLGK GITAASIGRL LKARGLKVTM QKFDPYINVD PGTMSPYQHG 
EVFVTDDGAE TDLDLGHYER FIDENLTKNS NVTTGKIYWS VIQRERRGDF LGGTVQVIPH
ITNEIKERIY RLGKSNSTDV VITEIGGTVG DIESLPFLEA IRQVATDIGK ENVLYVHVTL
VPYLSKSGEL KTKPTQHSVK ELRSIGIQPD IIVCRTEKPL SQELKAKIAL FCNLKPEYVI
QNIDAESLYE VPLMLEKEGL GEIICEKLGF VCTKPDLSDW IEIVEKEKNL KKSVRIALVG
KYVELHDAYL SVAEALKHAG IANDSYVEIL WTNAEEVTYD NAHEKLKSAD GILVPGGFGD
RGIEGKIAAI RYARENKIPF FGICLGMQCA VIEFARNVLG LERANSTEFD EATPYPVIDI
MPEQKDVFTK GGTMRLGLYP CKLEEGTLAH RIYNDELVYE RHRHRYEFNN EYKEKFKQAG
MVFSGISPDR RLVEIIELKD HPWFLGVQFH PEFKSRPQRP HPIFTDFIRA SLENRQKKEG
I