Gene Elen_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2106 
SymbolpyrG 
ID8416424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2477136 
End bp2478737 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID645025089 
ProductCTP synthetase 
Protein accessionYP_003182458 
Protein GI257791852 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0473416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.014711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC ACATCTTCGT TACCGGGGGC GTCGTCTCGT CGCTGGGCAA GGGCATTACG 
GCCGCGTCGC TGGGGCACTT GCTCAAGGCG CGCGGGTACC AAGTGACCAT GCAGAAGATG
GATCCGTACC TCAACGTGGA TCCGGGAACC ATGTCGCCGT TCCAGCACGG CGAAGTGTTC
GTCACCGAGG ACGGGCACGA AGGCGATCTC GACCTGGGGC ACTACGAGCG CTTCATCGAC
GAGAACCTCA CGCGCGAGTC CAACTTCACG CAGGGATCCA TCTACCAGAG CCTCATCGCG
CGCGAGCGGC GCGGCGACTA CCTGGGCGGC ACCGTGCAGG TGATTCCGCA CGTCACCGAG
GCCATCAAGG CGCGCCTGCG GCGCATCGCC GACCAAACCG GCGCCGACAT CGTCATATCC
GAGATCGGCG GCACCGTCGG CGACATCGAG TCGCTGCCGT TCATCGAGGC CGCGCGCCAG
TTCAAGAAGG AGCTGCCGTA CGGCGACGTG CTGTTCGTTC ACGTGACGCT GGTGCCTTAT
ATAGCCGCAG CGCACGAGGT GAAGACGAAG CCCACGCAGC ATTCGGTGAA GGAGCTGCGC
TCCATCGGCG TGCAGCCCGA CTTCATCGTG TGCCGCTCGG ACCACGAGAT CGAGGACGGC
GTGCGCGAGA AGATCGCGCT GTTCTGCGAC GTGCGGCCCG AGGAGGTGCT GGTCTGCACC
GACGCGCCGT CCATCTACGA GGTGCCGCTG GGCTTGCACG AGCAGCGCTT CGACGAGATG
GTGCTCGACC GGTTGTGCCT CGAGCGGCGT TCGGCCGACT TGGCGCCCCT GCGCTCGTTC
TTGGCCGCGG CGGACGCTTG CGATGAGGAG GTGGACGTGG CCGTGGTGGG CAAGTACGTG
AGCCTGCCCG ACGCGTATCT GTCGGTGATC GAGGCGCTCG GGCATGCGGG GGTGCGCTGC
GGATGCCGCG TGAACGTGCA CCTCGTGGAC GGCGAGGAGC TGTCGGACGC GAACGCGGGG
GCGGTGCTGG GCGGCATGGA CGGCATCCTC GTGCCGGGCG GTTTCGGCCA GCGCGCCTTC
GAGGGCAAGA TCGCCGCGGC TCGCTACGCG CGCAAGCAGG GCATTCCCTA CTTGGGCATC
TGCCTGGGGC TGCAGGCTGC CGTATGCATG TTCGCGCGCG ACGCGGCCGG CATGTCCGGG
GCCACGTCGG CCGAGTTCGA TGCCGAGGCG GCCTTTCCCG TGGTGGACCT CATGCCCGAG
CAGGAGGATG TGGAGGGCAA GGGCGGAACC ATGCGCCTGG GAGCTTATCC GTGCAAGGTG
GAGCCGGGCA CGAAGGCGTT CGAGGCTTAC GGCGAATCTA TTATATATGA GCGTCACCGT
CATCGTTATG AGGTGAACAA CGCCTTCCGC GAGCGCTTGG TCGAAGCGGG GCTTTCGGTG
TCGGGCGCGT CGCCGGACGG GCGGCTCGTG GAGATGGTGG AGCTGCCCGG CCATCCGTGG
TTCGTCGCCA GCCAGGGCCA TCCCGAGTTC AAGAGCCGCC CCACGCGGCC CCATCCGCTG
TTCTGCGGTT TCGTGAGCGC CGCCGCCGTG CGTCGCGCCT AG
 
Protein sequence
MAKHIFVTGG VVSSLGKGIT AASLGHLLKA RGYQVTMQKM DPYLNVDPGT MSPFQHGEVF 
VTEDGHEGDL DLGHYERFID ENLTRESNFT QGSIYQSLIA RERRGDYLGG TVQVIPHVTE
AIKARLRRIA DQTGADIVIS EIGGTVGDIE SLPFIEAARQ FKKELPYGDV LFVHVTLVPY
IAAAHEVKTK PTQHSVKELR SIGVQPDFIV CRSDHEIEDG VREKIALFCD VRPEEVLVCT
DAPSIYEVPL GLHEQRFDEM VLDRLCLERR SADLAPLRSF LAAADACDEE VDVAVVGKYV
SLPDAYLSVI EALGHAGVRC GCRVNVHLVD GEELSDANAG AVLGGMDGIL VPGGFGQRAF
EGKIAAARYA RKQGIPYLGI CLGLQAAVCM FARDAAGMSG ATSAEFDAEA AFPVVDLMPE
QEDVEGKGGT MRLGAYPCKV EPGTKAFEAY GESIIYERHR HRYEVNNAFR ERLVEAGLSV
SGASPDGRLV EMVELPGHPW FVASQGHPEF KSRPTRPHPL FCGFVSAAAV RRA