Gene Aazo_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1556 
Symbol 
ID9339348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1627849 
End bp1629471 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content43% 
IMG OID 
ProductGMP synthase large subunit 
Protein accessionYP_003720867 
Protein GI298490690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAG CGGTGACTCT ACCAACCAAA CAAGCACCTC AAGTACAAGA AAATTTGGGG 
GCTATTAATC GCCAAATAAT TGTTATTTTA GACTTTGGTT CTCAATATTC TGAACTAATC
GCTCGGCGTA TCCGTGAGAC TCAAGTATAT TCTGAAGTTC TCTCCTATCG CACCACAGCA
GAACATTTAC GCCAATTAAA TCCCAAGGGA ATTATCTTGT CTGGTGGGCC AAATTCAGTA
TATAGCGATT ATGCGCCCCA TTGTGACCCA GAAATCTGGA ATTTGGGAAT GCCCATCTTA
GGTGTATGCT ATGGAATGCA GTTGATGGTG AACCAACTAG GTGGGGAAGT CACCAAAGCT
GAGCGAGGTG AATACGGCAA AGCACCATTA TATATAGATG ATCCCACCGA TTTGCTAACT
AATGTTGAAG ATGGCACAAC AATGTGGATG AGTCATGGCG ATTCAGTCAC AAAAATGCCA
TCTGGATTTG AACTATTGGC ACATACAGAA AATACTCCCT GTGCTGCTAT TGCTGACCAT
GACAAGAAAC TTTATGGTGT ACAGTTCCAT CCAGAAGTGG TGCATTCCCT TGGTGGAATA
GCATTAATTC GTAACTTTGT TTACCACATC TGCGACTGTG AACCCACCTG GACAACAGCA
GCTTTTGTGG AAGAATCAAT TCGGGAAATT CGCGCTAGAG TTGGTGAGAA GCGCGTATTA
TTGGCTCTTT CTGGGGGTGT AGATTCTTCC ACTCTGGCAT TTTTGCTGTA TAAAGCCATT
GGTGAACAGC TAACTTGTGT CTTTATCGAC CAAGGCTTCA TGCGTAAGTT AGAGCCTGAA
AGATTACTCA AACTATTCCA AGAACAGTTT CATATTCCGG TGGAATATGT CAATGCTCGC
GATCGCTTTA TTAAAGCTAT TGCTGATATC ATAGACCCCG AAGAAAAACG CCGTCGCATC
GGCCATGAAT TTATACGCGT ATTTGAAGAA ACCTCCAAAA AACTCGGTCA CTTTGACTAT
TTAGCTCAAG GTACTCTCTA TCCTGATGTG ATTGAATCTG CTGATACTAA TGTTGACCCC
AAAACCGGCG AACGAGTAGC AGTAAAAATT AAGAGTCATC ACAATGTTGG TGGTTTACCC
AAAGACCTCA GATTTAAACT CGTTGAACCC TTGCGCAAAC TTTTTAAAGA TGAAGTCCGT
AAAGTAGGTC GTTCCATTGG TTTACCAGAA GAAATTGTCC AAAGACAACC CTTCCCCGGC
CCCGGTTTAG CAATTCGTAT CTTAGGCAAA GTCACAGCCG AAGGGTTAAA TATTTTACGC
GATGCTGATT TAATTGTCCG CCAAGAAATC AATCAGTGCG GCTTGTATCA TGACTATTGG
CAAGCATTTG CCGTATTATT ACCAATTCGG AGTGTAGGCG TAATGGGTGA TAAGCGTACC
TACGCTTACC CCATAGTTTT ACGGATTGTC ACCAGTGAAG ATGGGATGAC AGCAGACTGG
GCCCGTGTAC CTTACGATGT CCTAGAAGGA ATTTCTAACA GAATCGTCAA TGAGGTAAAA
GGCGTTAACC GTGTGGTTTA TGACATCACT TCCAAGCCAC CGGGAACTAT CGAGTGGGAA
TAG
 
Protein sequence
MNTAVTLPTK QAPQVQENLG AINRQIIVIL DFGSQYSELI ARRIRETQVY SEVLSYRTTA 
EHLRQLNPKG IILSGGPNSV YSDYAPHCDP EIWNLGMPIL GVCYGMQLMV NQLGGEVTKA
ERGEYGKAPL YIDDPTDLLT NVEDGTTMWM SHGDSVTKMP SGFELLAHTE NTPCAAIADH
DKKLYGVQFH PEVVHSLGGI ALIRNFVYHI CDCEPTWTTA AFVEESIREI RARVGEKRVL
LALSGGVDSS TLAFLLYKAI GEQLTCVFID QGFMRKLEPE RLLKLFQEQF HIPVEYVNAR
DRFIKAIADI IDPEEKRRRI GHEFIRVFEE TSKKLGHFDY LAQGTLYPDV IESADTNVDP
KTGERVAVKI KSHHNVGGLP KDLRFKLVEP LRKLFKDEVR KVGRSIGLPE EIVQRQPFPG
PGLAIRILGK VTAEGLNILR DADLIVRQEI NQCGLYHDYW QAFAVLLPIR SVGVMGDKRT
YAYPIVLRIV TSEDGMTADW ARVPYDVLEG ISNRIVNEVK GVNRVVYDIT SKPPGTIEWE