Gene HS_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0421 
SymbolguaA 
ID4239897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp449845 
End bp451416 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content43% 
IMG OID638103964 
ProductGMP synthase 
Protein accessionYP_718631 
Protein GI113460567 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATA TTCACCATCA CAAAATTTTA ATTTTAGACT TCGGTTCACA ATATACGCAA 
CTTATCGCAC GTCGTGTACG TGAAATCGGG GTATATTGCG AACTTTGGGC TTGGGACGTT
ACGGAAGAAC AAATTCGTGA GTTTAACCCA ACAGGAATTA TTCTTTCAGG CGGACCGGAA
AGTACTACTG AGGCAAATAG TCCACGTGCA CCGGAATACG TTTTCAATGC AGGCGTACCT
GTTTTAGGTA TTTGCTACGG CATGCAAACC ATGGCAATGC AGTTGGGCGG TTTAACAGAA
ACTTCTACGC ATCGTGAGTT TGGCTATGCT GAAGTTTCTC TACGAAATCC GACCGCACTT
TTTGATCATC TCAATGATGA TGCGACCACT TCTCAGACTA CACTTGATGT TTGGATGAGC
CACGGCGATA AAGTGACTCG CCTACCTGAT AATTTCCAAA TTACAGGCAT GACCTCGACT
TGCCCGATTG CGGCTATGTC AGATGAAAGC CGTCGTTTCT ATGGCGTGCA ATTTCACCCC
GAAGTTACCC ACACAAAGTG CGGTCAAAAA TTACTGCAAA ATTTTGTGGT AGATATTTGC
GGTTGCGAAA CCAATTGGAC CGCAGAAAAT ATCATCGAAG ATGCAGTGGC TCGCATTAAA
GCACAAGTGG GCGGTGATGA AGTAATTTTA GGCTTGTCAG GTGGCGTGGA TTCATCTGTT
ACCGCACTTT TATTGCATCG TGCCATCGGT AAAAATTTAC ATTGTGTCTT TGTCGATAAC
GGCTTACTCC GTCTAAATGA AGGCGATCAG GTCATGGAAA TGTTCGGTGA TAAATTCGGC
TTGAATATTA TTCGAGTAGA AGCAGAAGAT CGCTTTTTAG AAGCATTAAA AGGAATTGAT
GAACCGGAAG CAAAACGCAA AACTATCGGT AAAGTATTCG TTGATGTATT CGATGATGAA
GCAAAAAAAT TAACTGACGT AAAATGGTTA GCTCAAGGAA CGATTTACCC TGATGTTATC
GAATCGGCAG CAAGCAAAAC CGGAAAAGCC CACGTTATCA AATCTCACCA CAATGTAGGA
GGCTTACCCG ATTATATGAA ATTAGGTTTA GTTGAGCCAT TACGTGAACT CTTCAAAGAT
GAAGTGCGTA AAATCGGCTT GGCACTTGGC TTGCCTGCAG AAATGCTTAA TCGCCACCCA
TTCCCAGGCC CTGGATTAGG TGTACGTGTA CTGGGTGAAA TCAAAAAAGA ATATTGCGAT
TTACTGCGTA AAGCCGATGC AATTTTTATC GAAGAACTGC ATAAAGCAGA TTGGTACTAC
AAAGTCAGCC AAGCGTTCAG TGTTTTCTTG CCGGTAAAAT CTGTCGGGGT AATGGGCGAC
GGTCGTAAAT ATGATTGGGT TATTAGCCTA AGAGCGGTCG AAACCATTGA CTTTATGACC
GCACATTGGG CAAACCTACC TTATGATTTA TTAGGCAAAA TCTCAAATCG CATTATCAAC
GAAGTCAACA GCATCTCCCG TGTAGTTTAT GACATCTCAG GAAAACCACC AGCAACGATT
GAGTGGGAGT AG
 
Protein sequence
MTNIHHHKIL ILDFGSQYTQ LIARRVREIG VYCELWAWDV TEEQIREFNP TGIILSGGPE 
STTEANSPRA PEYVFNAGVP VLGICYGMQT MAMQLGGLTE TSTHREFGYA EVSLRNPTAL
FDHLNDDATT SQTTLDVWMS HGDKVTRLPD NFQITGMTST CPIAAMSDES RRFYGVQFHP
EVTHTKCGQK LLQNFVVDIC GCETNWTAEN IIEDAVARIK AQVGGDEVIL GLSGGVDSSV
TALLLHRAIG KNLHCVFVDN GLLRLNEGDQ VMEMFGDKFG LNIIRVEAED RFLEALKGID
EPEAKRKTIG KVFVDVFDDE AKKLTDVKWL AQGTIYPDVI ESAASKTGKA HVIKSHHNVG
GLPDYMKLGL VEPLRELFKD EVRKIGLALG LPAEMLNRHP FPGPGLGVRV LGEIKKEYCD
LLRKADAIFI EELHKADWYY KVSQAFSVFL PVKSVGVMGD GRKYDWVISL RAVETIDFMT
AHWANLPYDL LGKISNRIIN EVNSISRVVY DISGKPPATI EWE