Gene Aazo_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2084 
Symbol 
ID9339878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2169752 
End bp2170939 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content40% 
IMG OID 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_003721252 
Protein GI298491075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTAA TTTTAATGAC AGGTAAGGGT GGCGTAGGTA AAACCTCTGT TGCCGCAGCC 
ACTGGACTTC GGTCTGCAGA ACTCGGCTAT CGGACATTGG TTTTAAGTAC AGATCCTGCT
CACTCCTTAG CAGATAGTTT TGATATAGAA TTGGGACATG ATGCCAAACA AGTGCGCCCA
AATTTGTGGG GTGCAGAACT CGATGCACTG CAAGAATTAG AAGGTAACTG GGGTGCTGTA
AAGCGTTATA TTACCCAAGT CTTACAGGCA CGGGGTTTAG ACGGGATACA AGCGGAAGAA
TTGGCAATTT TACCAGGCAT GGATGAGATT TTCGGCTTGG TCAGAATGAA ACGTCACTAT
GATGAAGGGG AATTTGACGT TTTGATTATT GATTCTGCCC CAACTGGTAC TGCACTGCGT
TTGCTAAGTT TACCAGAAGT TGGTGGCTGG TATATGCGGC GTTTTTACAA ACCTTTTCAA
AATATCTCAG TGGCACTCAG ACCTTTAGTA GAACCGCTGT TTAGACCCAT TGCTGGTTTT
TCTTTACCAG ATAAAGAAGT AATGGATGCG CCTTATGAGT TTTATGAACA AATAGAAGCA
CTGGAAAAAG TATTGACTGA CAATAATCAA ACATCGGTTC GACTTGTCAC GAACCCAGAA
AAAATGGTGA TTAAAGAATC TCTTCGGGCT CATGCTTATC TGAGCTTGTA TAATGTAGCG
ACAGATTTAG TCGTAGCTAA TCGCATTATT CCTAAAGAAG TTGAAGATCC CTTTTTCCAA
CGTTGGAAAG AAAATCAAGA GCAATATCGC CAAGAAATTC ATGAAAACTT TCACCCCTTA
CCTGTGAAAG AAATTCCTCT TTATTCTGAG GAAATGTGTG GTTTAGCAGC ATTAGATAGA
CTGAAAGAAA CTCTCTACTC AGATGAAGAC CCAACTCAGA TTTATTACAA AGAAACTACT
ATGAGAATTG TGACGGAAAA TAACCAATAC AGCTTGGAAC TTTATTTACC TAATATTCCT
AAAAGCCAGA TTCAACTCAG TAAAACTGGT GACGAATTAA ACATTACTAT TGGTAATCAT
CGCCGTAACT TGATTTTACC CCAAGCTTTA GCCGCACTGC AACCATCAGG GGCAAAAATG
GATGATGATT ATCTAAAAAT TCGTTTTGCT GACAATGTAA GAGTCTAG
 
Protein sequence
MRVILMTGKG GVGKTSVAAA TGLRSAELGY RTLVLSTDPA HSLADSFDIE LGHDAKQVRP 
NLWGAELDAL QELEGNWGAV KRYITQVLQA RGLDGIQAEE LAILPGMDEI FGLVRMKRHY
DEGEFDVLII DSAPTGTALR LLSLPEVGGW YMRRFYKPFQ NISVALRPLV EPLFRPIAGF
SLPDKEVMDA PYEFYEQIEA LEKVLTDNNQ TSVRLVTNPE KMVIKESLRA HAYLSLYNVA
TDLVVANRII PKEVEDPFFQ RWKENQEQYR QEIHENFHPL PVKEIPLYSE EMCGLAALDR
LKETLYSDED PTQIYYKETT MRIVTENNQY SLELYLPNIP KSQIQLSKTG DELNITIGNH
RRNLILPQAL AALQPSGAKM DDDYLKIRFA DNVRV