Gene Haur_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3478 
SymbolguaA 
ID5735339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4381663 
End bp4383207 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content54% 
IMG OID641280625 
ProductGMP synthase 
Protein accessionYP_001546242 
Protein GI159899995 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGA ATGAAGCAAT TGTGGTGCTT GATTATGGCT CACAATATAG TCAATTGATT 
GTGCGGCGGG TTCGTGAAGC GGGCGTTTAT AGCGAGTTAG TGCGGTTCGA TGCCGACGAG
GCCACGGTTG CTGCGCTCAA CCCCAAAGGC ATCATTCTTT CGGGTGGGCC GAATAGCGTC
TATGCCGAGG GTGCGCCGCA ACTGCCAGCA TGGGTTTTGG CCAGCAACCT GCCTGTGTTG
GGCATTTGCT ATGGTTTGCA ATTGCAGGCC CATCATTTGG GTGGCAAAGT CGAGCCTTCC
AACGACCGCG AATTTGGCCA TGCCGTGATT ACCATTACCG CTGAATCGCC ACTCTTAGCC
GATATTCCGA CCGAACATTC GGTGTGGATG AGCCACGGCG ATCGCTTGGA AACCCTGCCT
GCTGGCTGGC ATCCAATCGC GATCAGCCCC AACTCGCCCT ATGCCGCTGC TGCCGATGAA
GATCGACGCT GGTATGGCTT GCAATTTCAC CCTGAGGTCG TGCACTCACC CTATGGCAAG
CAAGTGCTGC ACAATTTTCT CTACCGCATT TGCGCATGTG CTGGCAGTTG GCAGCCTAGC
CACTTTATCC AAGAGGCGGT TGAGCGAATT CAGCAGCAAG CGCCCAGCGG CCAAGTGATT
TGTGCGCTCA GCGGTGGAGT TGATTCGGCG GTAGCAGCGT TGTTGGCCCA CCAAGCCATT
GTCGAACGCT TGACCTGTAT TTTTGTCGAT AACGGTTTGT TGCGCCGCGG CGAGGCCGAG
CAAGTTTCAC GCACCTTCCG CGATCACTTC CAAATCGAAT TGATTACGGT TGATGCCGCT
GAAGAATTTT TGGCAGCACT GGCAGGCGTG ACCGATCCTG AGCAAAAACG CAAGATCATC
GGCGAAAAAT TTATTCGCAT TTTCGAGCGC GAAGCCCGTA ACCTCGATGA CGCGGCCTAT
TTGGTACAAG GCACGCTCTA TCCCGATGTG ATCGAATCGA CAGCGCCAGA TCGCAATGTG
GCAGTCAAGA TCAAAACCCA TCATAATGTT GGCGGCTTGC CCGATGATAT GAAACTCAAG
GTGATCGAGC CATTACGCAT GCTGTTTAAG GATGAAGTGC GGGCAGCCGG TTTGGCCTTG
GGTTTGCCCG AAGATTGGGT CTGGCGACAT CCCTTCCCTG GGCCAGGTTT AGCGGTGCGG
TTGCTGGGCG CAATCAGCTT TGAGCGCTTA GAAACCTTGC GCCATGCCGA TGCAATTTTC
CTTGAAGAAT TGCGGGCAGC AGGCTTGTAT CGCGCGACGC AGCAGGCCTT TGCGGTGTTG
TTGCCAGTGC AAAGTGTCGG CGTGATGGGT GATTATCGCA CCTATGCCGA TACCATTGCG
ATTCGTGCGG TGAGCACCGA AGATTTTATG ACCGCCGATT GGGCTCGCTT GCCCTATGAA
TTGTTGGCCA AAGTCTCCAG CCGCATTGTC AACGAGGTGA CGGGAGTTAA TCGGGTGGTG
TATGATATTT CATCAAAACC ACCAGCCACA ATCGAATGGG AATAA
 
Protein sequence
MTQNEAIVVL DYGSQYSQLI VRRVREAGVY SELVRFDADE ATVAALNPKG IILSGGPNSV 
YAEGAPQLPA WVLASNLPVL GICYGLQLQA HHLGGKVEPS NDREFGHAVI TITAESPLLA
DIPTEHSVWM SHGDRLETLP AGWHPIAISP NSPYAAAADE DRRWYGLQFH PEVVHSPYGK
QVLHNFLYRI CACAGSWQPS HFIQEAVERI QQQAPSGQVI CALSGGVDSA VAALLAHQAI
VERLTCIFVD NGLLRRGEAE QVSRTFRDHF QIELITVDAA EEFLAALAGV TDPEQKRKII
GEKFIRIFER EARNLDDAAY LVQGTLYPDV IESTAPDRNV AVKIKTHHNV GGLPDDMKLK
VIEPLRMLFK DEVRAAGLAL GLPEDWVWRH PFPGPGLAVR LLGAISFERL ETLRHADAIF
LEELRAAGLY RATQQAFAVL LPVQSVGVMG DYRTYADTIA IRAVSTEDFM TADWARLPYE
LLAKVSSRIV NEVTGVNRVV YDISSKPPAT IEWE