Gene Haur_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1743 
SymbolpyrG 
ID5733630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2028567 
End bp2030171 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content50% 
IMG OID641278885 
ProductCTP synthetase 
Protein accessionYP_001544514 
Protein GI159898267 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGT ATATTTTCGT GACGGGCGGC GTAGCTTCAT CGGTGGGCAA GGGAATCACC 
GTGGCCTCGC TGGGCCGTTT GCTGAAAAGT CGTGGCATTA GCGTTTCAAT TCAAAAACTT
GATCCATATA TCAATGTTGA CCCTGGTACG ATGTCGCCTT ACCAACATGG CGAGGTCTTT
GTGACCGAAG ATGGGGCTGA AACCGACCTC GATTTGGGCC ACTACGAACG CTTTATTGAT
GAAAATTTGT CCCGCGTTAA CAACATTACA ACCGGCCAAA TTTATTCTTC AGTGATTCAA
AAAGAGCGCC GTGGCGATTT CCTCGGCGGC ACGATTCAAG TAATTCCGCA TATCACCAAC
GAAATTAAAA GCCGCGTCGC TTTGGTAGCC AAGAATACCA ATGCTGATGT GGTGATTGTC
GAAATTGGCG GCACGGTTGG CGATATCGAA AGTTTACCCT TCCTCGAAGC CATTCGACAA
ATGAAAAAAG ATGTGGGCCG CGATAACGTG ATGTATATTC ACGTGACCTT GTTGCCTTAT
CTCCAAGCCA GTGGCGAACT CAAAACCAAG CCGACCCAAC ACTCAATCGC CGAATTACGC
CGCGTCGGTA TCTCGCCAGC CGTGGTGTTG TGTCGCTCCG ATTTGCCAGT TGATGATGAT
ATGCGTGAGA AAATCGCTCT GTTTGCTGAT TTACCCAACG AGGCTGTGGT AGCTCTGCCT
ACCGTTGATT CGATTTATGA AGTGCCGTTG GTGCTTGAGG AAGCAGGCTT AGGCGATTTA
ATTATCGAAC GCTTGGCTTT GGCTGCTCAG CCAGTCCAGC TCGATGAATG GCGCTCACTC
GTTGCACGTA TCAAACAACC CAAACGCCAT ACCACGGTGG CGATTGTTGG CAAATATGTT
GAACTGCGCG ATGCCTATAT GAGTGTGGCT GAATCGGTGC GCCACGCTGG TTGGGCCCAA
GATATTCAAG TTGATATCAA ATGGGTCTCT TCAGAAGAGC TTGAAGTGGC TGATCCTGTA
ACCATGCTTG GCGATGTCCA AGGGATTATC GTGCCTGGCG GCTTTGGCTA TCGTGGGGTC
GAGGGCAAAA TTCGGGCTGT GCGCTATGCT CGCGAAAACA AAATTCCCTT CTTGGGGCTT
TGTTTAGGCA TGCAATGTGC CACGATTGAA TTTGCTCGCT TTGCCTTGAA TGCGCCCGAT
GCCAACTCAA CCGAGTTTAA CCCGAACACC AAACTGCCAG TGATCGACTT TATGCCCGAT
CAATTGGATA TTAGTGATAA GGGTGGGACG ATGCGCTTGG GGGTTTACCC ATGTATTTTA
GCCCCCGATA CTAAGGCTGC CAAAGCCTAT GGCCGTGAAT TAGCCTTGGA ACGCCATCGC
CATCGCTTTG AGTTTAACAA CAAATATCGT AAAGCCATGG AAGCCGCTGG TTTTGTGATT
AGTGGCCACT CGCCTGATGG CCGTTTGGTC GAGATTGTCG AGTTGCGCGA CCATCCATGG
TTTGTGGCTT CGCAGTTCCA CCCTGAATTC AAATCGCGTC CAAACAACCC ACATCCATTG
TTCCGCGATT TTGTGCAGGC TGCCTTGGAA CAAATTGCTG AATAA
 
Protein sequence
MTKYIFVTGG VASSVGKGIT VASLGRLLKS RGISVSIQKL DPYINVDPGT MSPYQHGEVF 
VTEDGAETDL DLGHYERFID ENLSRVNNIT TGQIYSSVIQ KERRGDFLGG TIQVIPHITN
EIKSRVALVA KNTNADVVIV EIGGTVGDIE SLPFLEAIRQ MKKDVGRDNV MYIHVTLLPY
LQASGELKTK PTQHSIAELR RVGISPAVVL CRSDLPVDDD MREKIALFAD LPNEAVVALP
TVDSIYEVPL VLEEAGLGDL IIERLALAAQ PVQLDEWRSL VARIKQPKRH TTVAIVGKYV
ELRDAYMSVA ESVRHAGWAQ DIQVDIKWVS SEELEVADPV TMLGDVQGII VPGGFGYRGV
EGKIRAVRYA RENKIPFLGL CLGMQCATIE FARFALNAPD ANSTEFNPNT KLPVIDFMPD
QLDISDKGGT MRLGVYPCIL APDTKAAKAY GRELALERHR HRFEFNNKYR KAMEAAGFVI
SGHSPDGRLV EIVELRDHPW FVASQFHPEF KSRPNNPHPL FRDFVQAALE QIAE