Gene Aazo_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3553 
Symbol 
ID9341359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3622436 
End bp3623665 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content45% 
IMG OID 
Productaluminum resistance family protein 
Protein accessionYP_003722273 
Protein GI298492096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.916774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA TAGAGCAGCT GCAAGAAGCA GAACAGGCGC TATTAGAGAT TTTTTCTGGA 
ATTGACGCAC AGGTCAAGCA TAATCTTCAA AGAGTGCTAA CAGCATTTCG TAATCATCGA
GTAGGGGCGC ACCACTTTGC GAGTGTAAGC GGTTATGGAC ATGATGATTT AGGGAGAGAA
ACCTTAGATC AAGTTTTTGC CCAAGTAATG GGAGCAGAAG CTGCGTTGGT GCGAGTGCAG
ATTGTATCAG GAACTCATGC GATCACCTGT GCGCTTTATG GAGTGCTTCG GCCTGGGGAT
GAAATGTTAG CAGTGATTGG TTCTCCCTAC GATACTTTGG AAGAGGTAAT TGGCTTGCGT
GGTCAAGGCC AAGGGTCTCT TATTGATTTT GGCATAAAAT ACCGCCAACT AGAACTAAAC
GAAGAAGGAA AAATAGATTG GCAAGCATTA CAGCACGGAA TTCAAGAAAA TACCAAATTA
GTATTAATTC AACGTTCCTG TGGATATTTA TGGAGGCCAA GCCTCTCTAT ACAAGAGATT
GAAAAAATCA TTCACATAGT CAAACAGCAA AACCCCAACA CTGTATGTTT CGTAGATAAC
TGTTATGGCG AATTTATTGA TATTAAAGAA CCTACTCATG TAGGTGCTGA CTTAATGGCC
GGGTCATTAA TTAAAAATCC TGGCGGTACA TTAGTTACAG CAGGGGGATA TATAGCAGGA
AGAGCAGACC TAGTAGAAGC TGCAGCTTGT AGACTAACAG CCCCCGGAAT AGGTAGTGCT
GGAGGAGCGA CCTTCGACCA AAACCGCCTC TTATTCCAGG GATTATTTTT AGCACCGCAG
ATGGTTGGGG AAGCTATGAA AGGAACATAC CTAACAGGAT ACGTATTTGA CAAACTTGGA
TATCCAGTTA ACCCCCCACC CTTAGCACCA CGAGGAGATG TCATCCAAGC GATTAAACTG
GGTTCAGCCA AAAAGCTGAT CGCCTTTTGT AAAGCCATCC AACAGTCTTC ACCCATCGGG
TCTTATCTCG ACCCTATACC CGACGATATG CCAGGCTATG AAAGCGAAGT AGTCATGGCT
GGAGGCACAT TTATTGAAGG CAGCACCTTG GAATTATCAG CTGATGGCCC ATTACGTGAG
CCTTATGTTG TGTATTGTCA AGGGGGTACA CATTGGACTC ATGTAGCAAT CGCTTTACAG
GCAGCTATTG AGGCTGTAGG AGAAGCTTAG
 
Protein sequence
MNSIEQLQEA EQALLEIFSG IDAQVKHNLQ RVLTAFRNHR VGAHHFASVS GYGHDDLGRE 
TLDQVFAQVM GAEAALVRVQ IVSGTHAITC ALYGVLRPGD EMLAVIGSPY DTLEEVIGLR
GQGQGSLIDF GIKYRQLELN EEGKIDWQAL QHGIQENTKL VLIQRSCGYL WRPSLSIQEI
EKIIHIVKQQ NPNTVCFVDN CYGEFIDIKE PTHVGADLMA GSLIKNPGGT LVTAGGYIAG
RADLVEAAAC RLTAPGIGSA GGATFDQNRL LFQGLFLAPQ MVGEAMKGTY LTGYVFDKLG
YPVNPPPLAP RGDVIQAIKL GSAKKLIAFC KAIQQSSPIG SYLDPIPDDM PGYESEVVMA
GGTFIEGSTL ELSADGPLRE PYVVYCQGGT HWTHVAIALQ AAIEAVGEA