Gene Aazo_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1046 
Symbol 
ID9338842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1116720 
End bp1119932 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content39% 
IMG OID 
Productacriflavin resistance protein 
Protein accessionYP_003720531 
Protein GI298490354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00124975 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAA CAAACAACAA CGGCTTTAGT ATCAGCGCCA TTTCTATCCG TCAGCATATC 
GCTACTCTCA TGCTGACCTT GGCTGTAATT GTTATGGGTG TATTTTTCAT TATTCATTTA
CCCGTGGATT TACTTCCATC TATTACTTAT CCCCGAATTG GTGTGAGGAT AGAAGCACCA
GGAATTTCTC CAGAAGTAGC AATTAATGAA GTTACCAAAC CTCTAGAAGA AGCTTTTTCT
GCAACAGAAG GTGTAATTCA AGTTTTTTCC CGCACTCGTG AAGGACAAGT GAGTTTAGAT
TTGTATTTTC AACCAGGAGG AAATATTGAC CAAGCCTTAA ATAATGCTAC AGCAACCTTT
AATCGATCCA GAAACAGATT ACCAGACACT ATTACAGAAC CACGTTTATT TAAAGTAGAT
CCTTCCCAAT CACCTGTTTA TGAATTCGCA CTTACTTCAC CTACTCTTAA AGCTGTTGAC
TTGCGGGTTT TTGCAGAAGA AGAATTAGCT CGTGAATTGG GTGTAGTACC AGGAGTAGCA
ATAGTAGATG TATCAGGAGG AGTTAAGGAA GAAGTTAGGG TAAATGTTGA TTTAGATCGT
CTCCAATCTA TTGGTGTTGG TTTGACAGAT GTGTTAAATC AACTGCAAGA CCGTAACCAA
GATATTTCTG GAGGAAGGAT TCTAGGTCCA AATTCTGAAC CTTTAATCCG CACCATGGGA
CGCTTCCAGA ATGCTGAGGA AATTAACAAT ATTTCTTTTG AAGTATCTGC ACGTAATTCT
AATCTTAAAA ATCGTGTTTA TTTGCGTGAC TTTGCCCAAG TTATCGATGG TTCAGAACAA
CAACGGGTTT ATGTATTACT GAATGGTCAA GAAGCAGTTA AAGTCAGTAT TCAAAAACAG
CCAGATGCTA ATACTGTAAA TGTTGTTGAT GGTGTAAAAA AACGTTTAGA AAAACTTAAG
GAAGCTGGTG TAATTCCTGC TGAAGCAACC CTCACATCTA CTTTAGATGA ATCAATACTT
ATCCGCAGTT CTCTGGCTAA TCTTACTTCT TCTGGATTGA TTGGTTCAGG TTTAGCAGCT
ATAGCCGTTT TTCTGTTTCT AGGTTCTCTA AGACAAACTT TAATTATTAT TCTGGCTATT
CCCCTAGCAT TTTTAACAGC TATTATTTTC ATGGGAATAT TCGGTTTATC CCTGAATATT
TTTAGTTTGG GTGGTTTAGC CTTGGGTGTG GGAATTGTCG TTGATAATTC CATCGTCATG
TTAGAAAATA TTGCCGAGGG AATTAATCAA TCAAAATTAC AAAATCAAAC ATCAAAATTA
CCAACTGCTT TAATTATTCA ACAGGCAGAA AAAAGCAGTC GAGAAGTAGA ATCAGCTTTA
GTAGCTTCTA CAAGTACAAA CTTGGTAGCA GTATTACCAT TTTTGCTGAT TGGTGGTTTT
ATCTCATTAC TCTTTAATGA GTTAATTCTC ACCATCAGTT TTGCGGTAGC AGCTTCAGTT
CTTATTGCTG TTACTGTTGT CCCCATGCTG ACATCTCGAC TGTTAGCTTT ACCAGTTTCT
AGTTCTCTAG GTAATTTTTG GTTTTTCCGC GAGTTTAATC ACCGTTTTGA AGCCACTACA
AGATTATATG GTGGTTTGTT AGCTAGGATC TTACGCTGGC GGTTATTAAC TATTGCGATC
ACTACTATCA TATTAGGTGC TGGTAGTTGG TGGATGGCTC CTCAACTTCC CCAAGAAATT
CTCCCCCCCA TCAACACCGG ACAAGTTAGC TTAATAGCTC AATTTCCCCC CGGTACACCT
CTAGAAACTA ATCAAAAAGT CATGAGGGCT GTAGATCACA TTCTCCGTCA GCAACCAGAA
ACAGAATATG CGTTCTCTAC AGTAGGTGGT TTTCTCTTTG GTAGCAATAG TACTGCTAAT
CCCCTGAGAA GTTCCAGCAC CATTACCCTC AAACCAAAAA CAGATATAGA GACTTATGTT
GAGCGTGTCA CCAAAGAATT TACTAAGCTA AACCTAGTAG ATATTCGCCT CGGTTTAGCC
CCTGGTCAAG TGCGGGGTTT AATTCTTAAC AACTCCCCTA CTCGCGGTGC TGACGTTGAC
ATAATTCTCC AAGGAAATAA CTCAGACACT TTAGAGAAAG CAGGTCATGC TTTATTAAAA
ACTTTAGAAG AAAAAGTTAC CTCAGTTAGA TTCCGTCCTG ATGCAGATGT AACTCAACCA
GAAATCCAGA TTTTACCTGA CTGGGAACGA GTGGCTAATG TCGGTTTAAA TACTAAAGAT
ATTGGGGAAA CAATTCAGAC CGCCATTATA GGTAGCGTAG CCACCCAACT ACAACGCAAT
AACCGTTTGG TAGATGTGCG AGTTGAGTTA AATGAAGCTT CAATACAAAC AACTTCCCAG
TTAGAGAGAT TACCTTTATT TGTAGAAGGT AATCAACAAA TTCGCTTGAG TGATGTAGCG
ACAATTGCCG AAGGTAAAGC ACCAGGAGAA ATTCAACGGA TTAATCAGCG TCAAGTTTTT
TTAATAGCTG GTAATTTGGC AGAAGGAGCA AGTCTTAGTC AAGCATTAGA TCAAGTTGAT
CTAGTTCTCA AAAATGCAAA TTTCCCTCAA GGTGTCAGTC TTTTACCCAG TGCAACAGCG
GAATCTAATC AAGAACTACA AAAATCAATA CAACTGTTGG GAGGTTTAGC AATCTTTTTA
GTCTTTGTGG TGATGGCTGT ACAATATAAT TCCCTCATTG ATCCTTTAGT AATTCTGTTG
ACCATTCCTC TGGCATTAGC AGGAGGTATT TTTGGGCTTT ATATCACCGG AACTGCTATT
GGTGCTACTG TCGTCGTCGG TGCAGTTTTG TTAGTAGGTA TTGTGGTTAA TAACGCCATT
ATTATGGTGG AGTTAGCAAA TCAAATTCGA GAACGAGAAA GAATTGACCG TCGTACCGCC
ATTTTAAAAG CAGCACCTCA ACGTTTACGC CCTATTTTCA TGACAACAAT TACCACAGTT
CTGGGTATGT TTCCCCTGGC TTTAGGAATT GGTGAAGGTT CAGAGTTTCT TCAACCTTTG
GGTGTAGTTG TATTTTCTGG GTTGTCTTTA GCAACAATGC TTACTCTATT TATTATCCCC
TGTTTTTATA CTCTGCTCCA TGATTTAATT ACTGGACGTT GGGCAAAACC TGTACTTATC
AAAATCTGGA AGAGAAATTT AAACAAAGCA TAA
 
Protein sequence
MQITNNNGFS ISAISIRQHI ATLMLTLAVI VMGVFFIIHL PVDLLPSITY PRIGVRIEAP 
GISPEVAINE VTKPLEEAFS ATEGVIQVFS RTREGQVSLD LYFQPGGNID QALNNATATF
NRSRNRLPDT ITEPRLFKVD PSQSPVYEFA LTSPTLKAVD LRVFAEEELA RELGVVPGVA
IVDVSGGVKE EVRVNVDLDR LQSIGVGLTD VLNQLQDRNQ DISGGRILGP NSEPLIRTMG
RFQNAEEINN ISFEVSARNS NLKNRVYLRD FAQVIDGSEQ QRVYVLLNGQ EAVKVSIQKQ
PDANTVNVVD GVKKRLEKLK EAGVIPAEAT LTSTLDESIL IRSSLANLTS SGLIGSGLAA
IAVFLFLGSL RQTLIIILAI PLAFLTAIIF MGIFGLSLNI FSLGGLALGV GIVVDNSIVM
LENIAEGINQ SKLQNQTSKL PTALIIQQAE KSSREVESAL VASTSTNLVA VLPFLLIGGF
ISLLFNELIL TISFAVAASV LIAVTVVPML TSRLLALPVS SSLGNFWFFR EFNHRFEATT
RLYGGLLARI LRWRLLTIAI TTIILGAGSW WMAPQLPQEI LPPINTGQVS LIAQFPPGTP
LETNQKVMRA VDHILRQQPE TEYAFSTVGG FLFGSNSTAN PLRSSSTITL KPKTDIETYV
ERVTKEFTKL NLVDIRLGLA PGQVRGLILN NSPTRGADVD IILQGNNSDT LEKAGHALLK
TLEEKVTSVR FRPDADVTQP EIQILPDWER VANVGLNTKD IGETIQTAII GSVATQLQRN
NRLVDVRVEL NEASIQTTSQ LERLPLFVEG NQQIRLSDVA TIAEGKAPGE IQRINQRQVF
LIAGNLAEGA SLSQALDQVD LVLKNANFPQ GVSLLPSATA ESNQELQKSI QLLGGLAIFL
VFVVMAVQYN SLIDPLVILL TIPLALAGGI FGLYITGTAI GATVVVGAVL LVGIVVNNAI
IMVELANQIR ERERIDRRTA ILKAAPQRLR PIFMTTITTV LGMFPLALGI GEGSEFLQPL
GVVVFSGLSL ATMLTLFIIP CFYTLLHDLI TGRWAKPVLI KIWKRNLNKA