Gene Aazo_4831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4831 
Symbol 
ID9342638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4941514 
End bp4944450 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content40% 
IMG OID 
Productexcinuclease ABC subunit A 
Protein accessionYP_003723110 
Protein GI298492933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAC ACCTAGCAAC ATCTTTAAAT CATCGTCTTC CTTACCCCGA CAACAGCCAG 
AATACTATTC GGATTCGGGG TGCTAGGCAG CATAATCTGA AAAATATTGA TTTAGAACTA
CCTCGCGATC GCTTGATTGT CTTTACCGGC GTTTCTGGTT CTGGTAAGTC GTCTTTGGCG
TTTGATACCA TCTTTGCAGA AGGTCAGCGG CGTTATGTGG AATCTCTCAG CGCCTATGCT
CGGCAATTTT TGGGACAGTT GGATAAACCT GATGTGGAAG CGATTGAAGG TTTAAGTCCC
GCGATTTCCA TTGATCAAAA ATCAACTTCC CATAATCCTC GTTCGACGGT GGGAACGGTA
ACGGAAATTT ATGATTATCT GCGCTTGTTA TATGGTCGTG CTGGTGAACC CCATTGTCCT
AAATGCGATC GCTGTATTTC TCCCCAAACC ATTGACGAGA TGTGCGATCG CATCATGGAA
CTTCCAGACC GCACCCGTTT CCAAATTCTC GCACCTGTCG TCCGGGGGAA AAAGGGAACT
CACGGCAAAT TATTATCTAG TTTAGCCGCA CAAGGATTTG GCAGGATTAA GGTCAATGGA
GAGGTTTTAG AACTATCTGA CTTCATTGAG TTAGATAAAA ATATTACACA TAATATAGAA
CTGGTTATTG ATAGGTTAGT TAAAAAAGAC GGTATTCAAG AGCGTTTAGT TGATTCTCTC
ACCACCTGCC TTAACCAATC CAACGGGATT TCTATAATCG AAGTATTAAA TCATACCTGC
CATAAAGTAA AGCAGGAGGT ACCACATAGT AAGTATATAT CAAATGGGGG AGAAAATCAA
GATATTTCCG CACAAGAATT AGCATCAGAA ATGGTATTTT CAGAAAACTT TGCTTGTCCA
GAACATGGTG CAATTATGGA AGAATTATCA CCACGTTTGT TTTCTTTTAA TTCTCCCTAT
GGTGCTTGTT CCCATTGTCA TGGCATTGGG ACTTTAAAAA GATTTTCTCC AGAACTGGTC
ATACCTAACC AAGATTCGCC CATGTATGCT GCGCTCGCAC CTTGGTCAGA AAAAGAAAAT
TCCTACTATT TAGAATTACT GTATAGTTTA GGACAAGCCT ATGGATTTGA ATTACAGACC
CAGTGGAGCA AATTGACACC AGAACAGCAG CAGATAATTT TATATGGAGA AGAAAAACCC
GCAGAAGGGA AAAAACAAGC CTTTAAAGGA GTAGTTCCCA TCTTACAAAG ACAATATGAA
GGGGGCACAG AATTAGTTAA ACAGAAACTA GAAGAATATT TAATCGATCA ACCTTGTGAA
GTTTGTGGAG GAAAGCGGTT AAAACCCGAA GCCTTAGCCG TGAAATTAGG ACAATATGGA
ATTTCAGATT TAACCAGCGT CTCCATCCGC GAATGTCGAG AAAGAATAGA CCGATTGAAA
TTGACCCCAC GACAAATACA AATTGGCGAT TTAGTTCTCA AAGAAATTAA AGCCAGATTG
CAGTTTTTAT TAGATGTCGG GTTAGATTAT CTCACGTTAG ATCGTCCCGC CATGACATTA
TCAGGTGGAG AAGGCCAACG AATTCGGTTA GCAACACAAA TTGGTTCTGG ATTAACAGGA
GTTCTCTACG TTTTAGACGA ACCAAGCATC GGTTTACATC AAAGAGATAA CGGACGTTTA
TTGAAAACTT TAACCAAATT ACGCGACTTA GGAAATACAT TAATAGTCGT CGAACATGAT
GAAGAAACCA TACGTGCAGC TAATTATATA GTAGATATTG GTCCTGGTGC AGGAATTCAT
GGCGGAAATA TTATTTCTCA GGGAAATTTA GAAAATTTAT TAAATGCAGA AGCTTCTTTA
ACCGGTGCTT ACCTATCAGG AAGAAGAGTC ATTAATACCC CAGTCTCTAG AAGAGAAGGA
AATGGCAGAA CTTTAATAAT TAGAAACGCT CATCGCAACA ACCTCAGAAA TATAGATGTC
GAAATTCCTT TAGGTAAACT TGTCGCTGTC ACCGGTGTTT CTGGTTCAGG AAAATCTACC
TTAATCAATG AACTACTTTA TCCAGCATTA CAACATCATC TTTTAAAAAG AATTCCCTTT
CCCAAAGATA TTGATGAAAT TAAAGGTTTA AACTGCGTTG ACAAAGCCAT AGTCATCGAC
CAATCACCAA TTGGTAGAAC TCCCCGTTCT AACCCAGCAA CCTACACCGG AGTCTTCGAT
ATCATTCGAG ATGTGTTTTC CCAAACTATC GAAGCCAAAA CCAGAGGCTA CAAACCGGGA
CAATTTTCTT TCAACGTTAA AGGTGGAAGA TGTGAAGCTT GTAGCGGACA AGGTGTAAAC
GTCATTGAAA TGAACTTTTT ACCTGACGTT TATGTACAGT GTGAAATTTG CAAAGGTGCG
AGATACAACC GGGAAACATT ACAAGTTAAA TATAAAGATA AATCAATTTC CGACGTTCTC
AATATGACCG TTGAAGAAGC ATTAGCATTT TGTGAAAACA TTCCCAAAGC TGTCACCAGA
TTACAAACAT TAGTGGATGT GGGTTTAGGT TATGTGCAGT TAGGACAACC AGCCACAACT
TTATCTGGTG GAGAAGCACA ACGGGTTAAA TTAGCAACAG AATTATCTCG ACGTGCAACA
GGGAAGACTC TTTATTTAAT AGATGAACCA ACAACAGGAT TGTCTTTTTA CGACGTACAT
AAATTGTTGG ATGTTTTACA AAAATTGGTG GATAAAGGAA ATTCGATTTT AGTCATTGAA
CATAATTTAG ATGTGATTCG TTGTTCTGAT TGGGTAATAG ATTTGGGACC AGAAGGAGGA
GATCAAGGAG GACAAATTAT TATTGCAGGT ACACCGGAAG ATGTGGCGGA AAATCAACGT
TCTTATACTG GGGAATATTT AAGGCAGGTG TTGAAGCAAT ATTCGGCAGT TGTCTAA
 
Protein sequence
MSEHLATSLN HRLPYPDNSQ NTIRIRGARQ HNLKNIDLEL PRDRLIVFTG VSGSGKSSLA 
FDTIFAEGQR RYVESLSAYA RQFLGQLDKP DVEAIEGLSP AISIDQKSTS HNPRSTVGTV
TEIYDYLRLL YGRAGEPHCP KCDRCISPQT IDEMCDRIME LPDRTRFQIL APVVRGKKGT
HGKLLSSLAA QGFGRIKVNG EVLELSDFIE LDKNITHNIE LVIDRLVKKD GIQERLVDSL
TTCLNQSNGI SIIEVLNHTC HKVKQEVPHS KYISNGGENQ DISAQELASE MVFSENFACP
EHGAIMEELS PRLFSFNSPY GACSHCHGIG TLKRFSPELV IPNQDSPMYA ALAPWSEKEN
SYYLELLYSL GQAYGFELQT QWSKLTPEQQ QIILYGEEKP AEGKKQAFKG VVPILQRQYE
GGTELVKQKL EEYLIDQPCE VCGGKRLKPE ALAVKLGQYG ISDLTSVSIR ECRERIDRLK
LTPRQIQIGD LVLKEIKARL QFLLDVGLDY LTLDRPAMTL SGGEGQRIRL ATQIGSGLTG
VLYVLDEPSI GLHQRDNGRL LKTLTKLRDL GNTLIVVEHD EETIRAANYI VDIGPGAGIH
GGNIISQGNL ENLLNAEASL TGAYLSGRRV INTPVSRREG NGRTLIIRNA HRNNLRNIDV
EIPLGKLVAV TGVSGSGKST LINELLYPAL QHHLLKRIPF PKDIDEIKGL NCVDKAIVID
QSPIGRTPRS NPATYTGVFD IIRDVFSQTI EAKTRGYKPG QFSFNVKGGR CEACSGQGVN
VIEMNFLPDV YVQCEICKGA RYNRETLQVK YKDKSISDVL NMTVEEALAF CENIPKAVTR
LQTLVDVGLG YVQLGQPATT LSGGEAQRVK LATELSRRAT GKTLYLIDEP TTGLSFYDVH
KLLDVLQKLV DKGNSILVIE HNLDVIRCSD WVIDLGPEGG DQGGQIIIAG TPEDVAENQR
SYTGEYLRQV LKQYSAVV