Gene TM1040_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1973 
SymbolpyrG 
ID4077157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2077611 
End bp2079254 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content61% 
IMG OID638007288 
ProductCTP synthetase 
Protein accessionYP_613967 
Protein GI99081813 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.112708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT TCATCTTTAT TACTGGCGGT GTCGTCTCTT CTCTGGGCAA GGGACTGGCC 
TCTGCTGCGC TGGGTGCGCT GCTGCAGGCA CGGGGCTACT CCGTGCGCCT GCGCAAACTC
GACCCCTATC TGAACGTCGA TCCGGGCACC ATGAGCCCGT TTGAACACGG CGAGGTCTTT
GTGACCGACG ACGGGGCAGA AACCGACCTC GATCTGGGCC ACTACGAGCG CTTCACCGGC
GTGCCAGCGC GTAAGACCGA CTCGATCTCC TCCGGGCGGA TCTACACCAA CGTGCTCGAG
AAAGAGCGCC GCGGCGACTA CCTCGGCAAA ACGATCCAGG TGATTCCGCA TGTCACCAAC
GAGATCAAGG ACTTCATCTC CATCGGAGAG GATGAGGTCG ATTTCATGCT CTGCGAGATC
GGCGGCACCG TGGGTGACAT CGAGGGGCTG CCCTTCTTTG AGGCCATCCG CCAGTTCAGC
CAGGACAAGC CGCGCGGTCA GTGTATCTTC ATGCACCTGA CGCTGCTTCC CTACATCAAG
GCCTCTGGCG AGTTGAAAAC CAAACCGACC CAGCACTCCG TGAAGGAGCT GCGTTCCATC
GGTCTGGCCC CCGATATTCT GGTCTGCCGC TCCGAGGGTC CGATCCCGGT GAAAGAACGC
GAAAAGCTGG CGCTCTTCTG CAACGTGCGC GCCGACAGCG TGATTGCGGC GCAGGATCTG
AAATCCATCT ACGAAGCGCC GCTGGCCTAT CACCGCGAAG GACTCGATCA GGCGGTTCTG
GACGCCTTTG GCATCGCCCC TGCCCCGCGC CCGACGCTGG ACACCTGGGA AGACGTGGCC
GACCGCATCT ACAACCCCGA GGGCGAAGTC AAAGTCGCCA TCGTCGGCAA ATACACCCAG
CTGGAAGACG CCTATAAATC CATCGCCGAG GCGCTCACCC ATGGTGGCAT GGCGAACCGG
GTAAAGGTGC GTATCGAATG GGTCGATGCA GAGCTCTTCG ACAAGGAAGA CGCAACGCCG
TATCTGCAGG GTTTTCACGC GATCCTCGTG CCCGGCGGCT TTGGCGAGCG CGGCACCGAA
GGCAAGATCA AGGCGGCGCA ATATGCCCGC GAGCACAAGG TGCCCTACCT CGGTATCTGC
CTCGGTATGC AGATGGCCGT GATCGAAGCT GCCCGCAACG TGGCCGGCAT CGAGGAAGCA
GGCTCCGAAG AGTTCGACCA TGAAGCGGGC AAAAAGCGTT TTGAGCCCGT TGTTTACCAC
CTCAAGGAAT GGGTGCAGGG CAACCACAAG GTGAGCCGCA GCGCCGATGA CGACAAAGGC
GGCACCATGC GTCTTGGCGC CTATGATGCG GCGCTTACCG AAGGTTCCAA AGTGGCCGAG
GTCTATGGCT CCAACGCGAT TGAGGAGCGT CACCGCCACC GCTATGAGGT GGACATCAAG
TACCGCGAGC AACTCGAAGC CTGTGGTCTG AAGTTCACTG GCATGAGCCC GGATGGCCGC
CTGCCCGAGA TCGTGGAATG GACCGATCAT CCGTGGTTTA TCGGCGTGCA GTTCCACCCG
GAACTCAAGT CCAAGCCCTT TGATCCGCAC CCGCTGTTCA AGGATTTTGT ACGCGCGGCC
AAGGACACAT CCCGCTTGGT CTAA
 
Protein sequence
MARFIFITGG VVSSLGKGLA SAALGALLQA RGYSVRLRKL DPYLNVDPGT MSPFEHGEVF 
VTDDGAETDL DLGHYERFTG VPARKTDSIS SGRIYTNVLE KERRGDYLGK TIQVIPHVTN
EIKDFISIGE DEVDFMLCEI GGTVGDIEGL PFFEAIRQFS QDKPRGQCIF MHLTLLPYIK
ASGELKTKPT QHSVKELRSI GLAPDILVCR SEGPIPVKER EKLALFCNVR ADSVIAAQDL
KSIYEAPLAY HREGLDQAVL DAFGIAPAPR PTLDTWEDVA DRIYNPEGEV KVAIVGKYTQ
LEDAYKSIAE ALTHGGMANR VKVRIEWVDA ELFDKEDATP YLQGFHAILV PGGFGERGTE
GKIKAAQYAR EHKVPYLGIC LGMQMAVIEA ARNVAGIEEA GSEEFDHEAG KKRFEPVVYH
LKEWVQGNHK VSRSADDDKG GTMRLGAYDA ALTEGSKVAE VYGSNAIEER HRHRYEVDIK
YREQLEACGL KFTGMSPDGR LPEIVEWTDH PWFIGVQFHP ELKSKPFDPH PLFKDFVRAA
KDTSRLV