Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1054 |
Symbol | guaA |
ID | 3678606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1283341 |
End bp | 1284963 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637716390 |
Product | GMP synthase |
Protein accession | YP_321573 |
Protein GI | 75907277 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0138128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000561419 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATACAG CGGTGACTCT ACTAACCGAA CAAGCCCCTC AACCAATAGA AGAGTTTGGG CAGCTTGAGC GTCAAATCAT TATCATATTA GACTTCGGCT CTCAGTATTC AGAACTGATT GCCCGGCGCA TCCGCGAGAC TCAAGTATAC TCTGAAGTTT TGTCTTATCG CACCTCTGCT GAACATTTAC GCCAACTAAA TCCCAAGGGC ATCATTCTTT CTGGTGGTCC CAGTTCAGTT TATAGCGATC GCGCTCCCCA TTGTGACCCA GAAATTTGGA ATTTGGGAGT TCCCATTTTG GGTGTCTGCT ACGGAATGCA GCTGATGGTC AACCAACTCG GCGGCGAAGT AGCCAAAGCT GACCGGGGTG AGTATGGCAA AGCATCATTA CATATTGATG ATCCCACCGA CCTACTGACC AACGTTGAAG ATGGCACAAC GATGTGGATG AGTCATGGTG ATTCAGTCAC AAAAATGCCC CCAGGGTTTG AAGTGCTGGC ACATACAGAT AATACTCCCT GTGCTGCTGT TGCTGACCAC GACAAAAAGC TTTATGGCGT GCAGTTCCAT CCAGAAGTAG TCCATTCCAT CGGTGGTTTG GCATTAATCC GCAACTTTGT GTATCACATC TGCGAGTGTG AACCCACCTG GACAACAGCC GCTTTTGTGG AAGAAGCTAT TCGGGAAGTT CGGGCTAAAG TTGGTGACAA GCGAGTACTA TTGGCGCTTT CGGGAGGAGT CGATTCTTCT ACACTGGCTT TCTTGATGCA CAAAGCCATC GGCGACCAAT TGACCTGTGT ATTTATAGAC CAAGGCTTTA TGCGGAAGTA TGAGCCAGAA AGGTTGGTGA AACTATTTCA AGAGCAGTTT CACATCCCTG TTGAATATGT TAACGCCCGC GATCGCTTCT TAGATATCAT GGTTGGCGTG ACAGACCCAG AAGAAAAACG TCGTCGCATT GGACATGAAT TCATCCAGGT ATTTGAAGAA ACATCGAGAA ATTTGGGACC CTTTGACTAT CTAGCACAAG GGACTCTTTA CCCAGATGTC ATTGAATCAG CTGATACCAA TGTTGACCCC CAAACCGGGG AACGGGTAGC AGTAAAAATC AAAAGCCATC ACAACGTTGG TGGATTACCC AAAGACTTGA GATTTAAACT GGTTGAACCC CTGCGGAAAC TATTTAAAGA TGAAGTCCGT AAAGTAGGGC GGTCTGTTGG CTTACCAGAA GAGATTGTCC AACGCCAGCC ATTTCCTGGC CCTGGTTTAG CTATTCGGAT TTTAGGTGAA GTCACCGCCG ACAGATTGAA CATTCTGCGC GATGCTGATT TGATTGTGCG CCAAGAAATT AACCAACGTG GTTTATACAA CGAATATTGG CAAGCATTCG CCGTCTTACT ACCTATCCGT AGTGTGGGCG TAATGGGTGA TCAACGCACC TATGCTTATC CTATAGTCTT ACGCATTGTT AAAAGCGAAG ATGGCATGAC AGCAGATTGG GCGCGTGTAC CTTACGATGT TTTAGAAGCA ATTTCTAACC GCATTGTCAA CGAGGTTAAA GGCGTTAATC GCGTAGTGTT TGATATTACC TCCAAGCCAC CCGGAACCAT CGAGTGGGAA TAA
|
Protein sequence | MNTAVTLLTE QAPQPIEEFG QLERQIIIIL DFGSQYSELI ARRIRETQVY SEVLSYRTSA EHLRQLNPKG IILSGGPSSV YSDRAPHCDP EIWNLGVPIL GVCYGMQLMV NQLGGEVAKA DRGEYGKASL HIDDPTDLLT NVEDGTTMWM SHGDSVTKMP PGFEVLAHTD NTPCAAVADH DKKLYGVQFH PEVVHSIGGL ALIRNFVYHI CECEPTWTTA AFVEEAIREV RAKVGDKRVL LALSGGVDSS TLAFLMHKAI GDQLTCVFID QGFMRKYEPE RLVKLFQEQF HIPVEYVNAR DRFLDIMVGV TDPEEKRRRI GHEFIQVFEE TSRNLGPFDY LAQGTLYPDV IESADTNVDP QTGERVAVKI KSHHNVGGLP KDLRFKLVEP LRKLFKDEVR KVGRSVGLPE EIVQRQPFPG PGLAIRILGE VTADRLNILR DADLIVRQEI NQRGLYNEYW QAFAVLLPIR SVGVMGDQRT YAYPIVLRIV KSEDGMTADW ARVPYDVLEA ISNRIVNEVK GVNRVVFDIT SKPPGTIEWE
|
| |