Gene Aazo_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2072 
Symbol 
ID9339866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2155543 
End bp2157282 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content46% 
IMG OID 
Productsingle-stranded nucleic acid-binding R3H domain-containing protein 
Protein accessionYP_003721243 
Protein GI298491066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTA CAGACGATCT CCAAAAGTTA CTGGACATTT TGCCCCAAGA CCTGCGGCAA 
GAACTAGAAA ATCATCCCAA AAGAGATTAT CTAGTGGAAG TGGTCTTGGA CTTAGGTCGT
CGCCCAGAAG CTCGGTTTCC CCATGCAGCT GAGTATCTGA GCGAAACCCC CGTCACTCAA
GCACAAATAG ATGATTGCAT TCAACGAGTC GGAACCTTTG GTGGAGATAA TCGGGCAGGA
ATTGAGCAAA CTTTACATCG CATCAGTGCT ATCCGTAACC GCACAGGCAA AATTATTGGT
TTAACTTGTC GTGTCGGCCG GGCGGTATTT GGCACAATTG GCATGATCCG CGATTTGGTA
GAAACCGGCA AATCCATTCT CATGTTAGGT TGTCCTGGTG TGGGTAAAAC CACTGCTTTA
CGGGAAATTG CCCGTGTATT GGCAGATGAT TTAAATAAAC GAGTGGTAAT TATTGACACC
TCTAATGAAA TCGCTGGTGA TGGTGATATA GCTCACTCTG CCATTGGCCG CGCTCGCCGG
ATGCAAGTAG CAAAGCCAGA ATTACAACAT CAAGTGATGA TTGAGGCAGT GGAAAACCAT
ATGCCGGAGG TGATTGTCAT TGATGAAATC GGCACAGAAC TGGAAGCTTT AGCAGCCCGT
ACAATTGCAG AGAGAGGTGT ACAATTAGTT GGTACTGCTC ACGGAAACCA AATTGAAAAC
CTCATCAAAA ATCCCACGCT GTCTGATTTG GTGGGGGGTA TTCAAGCAGT GACGCTGGGA
GATGACGAAG CGAGACGACG CGGTTCACAG AAGACCGTTT TGGAACGTAA AGCTCCACCT
ACCTTTGAGA TTGCTGTGGA AATGATGGAA CGGCAACGCT GGGTAGTGCA TGAAAGCGTA
GCTGATACAG TGGATACTCT GTTAAGAGGT CGTCAAGCTA ACCCACAAAC ACGAAGTATG
GATGAAAATG GCAAAGTGGC GATTACCAGA CAATTATCTG TGGTTCAGGG TCGTGGAGGT
AACTTAGCTA GTGAGGAAGA ATCTTTACCG GCTGTGAAGC AGGTTAATGG CTGGCGTAGT
TCTGGTCAGA TGGTGGCTCT ACCACCTTTA TCTATAGAAC GGGAACGGGT GACAGGACGC
AGTGAGTTTG ACCGCTTGCT GGATGAGTCT TTCAATTATT CTTCTGACAG TATTGATTTT
AGTCATACTA AATCAGCCGG TCCAAACGGG GAAGATTTAC CTTTGCATAT TTACCCATAT
GGGGTGAGTC GTCACCAACT GGAACAGGTG ATTAGTGTGT TAACTTTGCC TGTTGTATTG
ACAAAAGACA TAGATAGTGG TGATGCAATT TTGGCGTTGC GATCGCACGT GAAAAACCAC
GCTAAATTAA AGCAAATAGC CAAGGCTCGT CATTTACCAA TTCATGTGAT TAAGTCTAGC
ACCATACCCC AAATTACTCG CGGTTTGCGG CGGTTGCTGA ATATTGATGA TCCAGAAATC
GGTGATGACC GAGAATTACA ACTATTACTC CATAGTGGGA GTGATGACGA AATAGACGCT
TTGGAAGAAG CCAGACTTGC AGTTGAGCAA ATTGTCATTC CTAAAGGTCA ACCTGTGGAG
TTATTACCCC GTTCTCCGCA AGTAAGAAGA ATGCAGCATG AGTTGGTAGA ACATTATAGG
CTCAAGTCCA ATAGTTTTGG GGAAGAACCA AACCGACGGT TAAGAATTTA TCCGGCATAA
 
Protein sequence
MTITDDLQKL LDILPQDLRQ ELENHPKRDY LVEVVLDLGR RPEARFPHAA EYLSETPVTQ 
AQIDDCIQRV GTFGGDNRAG IEQTLHRISA IRNRTGKIIG LTCRVGRAVF GTIGMIRDLV
ETGKSILMLG CPGVGKTTAL REIARVLADD LNKRVVIIDT SNEIAGDGDI AHSAIGRARR
MQVAKPELQH QVMIEAVENH MPEVIVIDEI GTELEALAAR TIAERGVQLV GTAHGNQIEN
LIKNPTLSDL VGGIQAVTLG DDEARRRGSQ KTVLERKAPP TFEIAVEMME RQRWVVHESV
ADTVDTLLRG RQANPQTRSM DENGKVAITR QLSVVQGRGG NLASEEESLP AVKQVNGWRS
SGQMVALPPL SIERERVTGR SEFDRLLDES FNYSSDSIDF SHTKSAGPNG EDLPLHIYPY
GVSRHQLEQV ISVLTLPVVL TKDIDSGDAI LALRSHVKNH AKLKQIAKAR HLPIHVIKSS
TIPQITRGLR RLLNIDDPEI GDDRELQLLL HSGSDDEIDA LEEARLAVEQ IVIPKGQPVE
LLPRSPQVRR MQHELVEHYR LKSNSFGEEP NRRLRIYPA