Gene Mlg_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1841 
SymbolpyrG 
ID4269209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2098888 
End bp2100510 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID638126597 
ProductCTP synthetase 
Protein accessionYP_742675 
Protein GI114320992 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.41222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAT ACATTTTCAT CACCGGCGGT GTCGTATCGT CCCTGGGAAA GGGCATCACC 
GCCGCTTCCC TGGGTACCAT CCTGCAGGCC CGTGGCCTGA GTGTCTCCAT GACCAAACTG
GATCCCTACA TCAATGTGGA TCCGGGCACT ATGAGCCCTT TCCAGCACGG CGAGGTCTAC
GTCACCGACG ACGGCGCGGA GACCGACCTG GACCTGGGGC ACTACGAGCG TTTCGTGCGC
ACCACCATGA CCCGCAACAA CAACTACACC ACCGGCCGGA TCTACGAATC CGTCATCCGC
AAGGAGCGCC GGGGTGAGTA TCTGGGCGGT ACGGTGCAGG TCATCCCCCA CATCACCGAC
GAGATCAAGC GCAGCATCCA GCAGGGTGCC GACGACGCGG ACATCGCCCT GATCGAGATC
GGCGGTACGG TGGGCGATAT CGAATCGCTC CCCTTCCTGG AGGCCATCCG CCAGATGGGC
GCCGAGCTCG GCCGTGGCCG TTGCCTGTTT ATGCACCTCA CCCTGGTGCC CTTCATCGGT
GCCGCGGGCG AGATGAAGAC CAAGCCCACC CAGCACTCGG TCAAGGAACT GCGCTCCATC
GGCATCCAGC CCGATATCCT GGTCTGCCGG GCCAGTCAGC GCATCCCCGA GGAAGAGCGC
CGCAAGATCG CCCTGTTCAC CAACGTGGAG CCGCGGGCGG TGGTCTCCTG TCTGGACGTG
GACAACATCT ACAAGATCCC CGAGGTGCTG CACCGGCAGG GGCTGGACAA CATCGTTGCG
GAGAAGCTCG GTCTGGAGCT GCCGCCGGCC AGCCTGCAGG ACTGGCAGCG GGTGGTGGAG
GCCATGCAGA ACCCCGAGGG CGAGGTCACC ATCGCCATGG TGGGCAAGTA CGTGGATCTC
ACCGATGCCT ACATGTCGCT CAACGAGTCG CTGCGCCACG CGGGGATACA GACCCGGCAC
CGGGTCAATA TCCGGTACAT CGACTCCGAG GAGTTGGAAC GCGAAGGGAC CCACGCCTTG
GACGGGGTGG ACGCCGTCCT GGTGCCCGGT GGCTTCGGCG AGCGCGGCGT GGAGGGCAAG
ATCCTGGCGG CCCGCTACGC CCGGGAGCGC AAGGTGCCTT ACCTGGGCAT CTGCCTGGGG
ATGCAGGTGG CGGTCATCGA GTACGCCCGC AACGTCGCCG GGCTGGAGGG GGCCCACAGC
ACCGAATTCA CCCGCCACCC CCACCATCCG GTCATCGGCC TGATCACCGA GTGGATGACC
GACGAGGGCA CCGTGGAGCA GCGTAGCGAG GACTCCGACC TGGGCGGCAC CATGCGCCTG
GGGGCCCAGC CCTGTCGGCT GACCGAGGGC TCGCTGGCCC GCCAGGTCTA CGGCAAGGAC
GTGGTGGAGG AGCGCCACCG CCATCGCTAC GAATTCAACA ACCACTACCT GGAGGCGCTG
GAGGCGGCCG GGCTGCGGTT CTCCGGCTGG TCCCACGACC GCAAACTGGT GGAAGTGGTG
GAGCAGCCGG ACCATCCCTG GTTTCTGGCC TGCCAGTTCC ACCCGGAGTT CACCTCCACG
CCCCGCGACG GCCATCCGCT GTTTGCCGCC TTCGTGCGTG CCGCCATCGC CCACAGGGGC
TAA
 
Protein sequence
MTRYIFITGG VVSSLGKGIT AASLGTILQA RGLSVSMTKL DPYINVDPGT MSPFQHGEVY 
VTDDGAETDL DLGHYERFVR TTMTRNNNYT TGRIYESVIR KERRGEYLGG TVQVIPHITD
EIKRSIQQGA DDADIALIEI GGTVGDIESL PFLEAIRQMG AELGRGRCLF MHLTLVPFIG
AAGEMKTKPT QHSVKELRSI GIQPDILVCR ASQRIPEEER RKIALFTNVE PRAVVSCLDV
DNIYKIPEVL HRQGLDNIVA EKLGLELPPA SLQDWQRVVE AMQNPEGEVT IAMVGKYVDL
TDAYMSLNES LRHAGIQTRH RVNIRYIDSE ELEREGTHAL DGVDAVLVPG GFGERGVEGK
ILAARYARER KVPYLGICLG MQVAVIEYAR NVAGLEGAHS TEFTRHPHHP VIGLITEWMT
DEGTVEQRSE DSDLGGTMRL GAQPCRLTEG SLARQVYGKD VVEERHRHRY EFNNHYLEAL
EAAGLRFSGW SHDRKLVEVV EQPDHPWFLA CQFHPEFTST PRDGHPLFAA FVRAAIAHRG