Gene Aazo_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0306 
Symbol 
ID9338090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp305321 
End bp306526 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content42% 
IMG OID 
ProductHtrA2 peptidase 
Protein accessionYP_003720013 
Protein GI298489836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00230132 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTAT CTGTGAAGCA ACTAGTCGTT TACTTGTTTT TAGTAGCTGT TGGTGGAGGT 
GGAGGTGTAT TTGGCAGTCG CTATTTTTTG CCCCAGCATC ACTCATTTCA AGAGTTAGAA
AATGTCACAG TGGCTTTACC TCCAGAAGCA GTTGTTCCCT ATCCTATTGA TGGAGCAACT
AACTCTACTA AGAGTGATAA TGTCAACTTT ATTGCTACTG CTGTACAAAA AGTAGGATCG
GCAGTTGTAC GAATTAATGC TACTCGTAAA GTAGCAAATC CAATTTTTGG CGCATTTGAC
AACTCTATGT TAAAGCGTTT TTTTGGGGAA GATGAAGAAC CAATTCCTTC GGAACGAATT
GAGCGTGGTA CAGGATCGGG GTTCATTTTA AGCGCCAATG GTCAGTTACT AACGAATGCT
CATGTAGTAG ATAATACTGA TACCGTACAA GTTACGCTCA AGGACGGGCG AACTTTTGAT
GGTAAGGTGG TAGGAATTGA TACTATAACC GACGTCGCAG TGGTCAAAAT TGCCGCTGAT
AATTTACCGA CGGTGAAATT AGGGAATTCG CAAAACTTAA TTCCTGGACA GTGGGCAATC
GCTATTGGTA ATCCTTTAGG TTTAGATAAT ACTGTTACTA TTGGTATCAT TAGCGCCACC
GACCGTACTA GTGCCCAAGT TGGTGTTCCT GATAAGCGGG TAAGTTTTAT CCAAACCGAT
GCAGCAATAA ACCCTGGTAA CTCTGGCGGC CCTCTCTTAA ACACCCAAGG AGAAGTTATT
GGCATTAATA CCGCCATCCG CACCGACGCT CAAGGACTTG GTTTTGCTAT TCCCATTGAA
ACTGCTGCCC GCATAGCTCA TGAGTTATTT ACCAAAGGAA AAGCAGAACA CCCCTTTTCA
GGAATTGAAA TGGCAGAGCT TTCACCTGCC AAAAAACAAG AATTGAATCA AAAAAAGCAA
CTCAACATTC AGCTTGATGT CAGTTTTGCC ATTAAAGGAA TTGTGGCAAA TTCCCCAGCA
CAAAAGGCTG GTTTACTCAT AGGCGATGTG ATTCAAAAAA TCAATGGCAA ACCAATTAAA
AGTTTAGCCC AAGCACAGAA AATTATTGAG TTTAGTACAG TCGGTGACAT TCTGACAATT
GAAGTCCACC GCAACGGCAA AACTCAAATC TTCAAAATAC GCTCAGGAAC TTACCCTCAC
AAATAG
 
Protein sequence
MKLSVKQLVV YLFLVAVGGG GGVFGSRYFL PQHHSFQELE NVTVALPPEA VVPYPIDGAT 
NSTKSDNVNF IATAVQKVGS AVVRINATRK VANPIFGAFD NSMLKRFFGE DEEPIPSERI
ERGTGSGFIL SANGQLLTNA HVVDNTDTVQ VTLKDGRTFD GKVVGIDTIT DVAVVKIAAD
NLPTVKLGNS QNLIPGQWAI AIGNPLGLDN TVTIGIISAT DRTSAQVGVP DKRVSFIQTD
AAINPGNSGG PLLNTQGEVI GINTAIRTDA QGLGFAIPIE TAARIAHELF TKGKAEHPFS
GIEMAELSPA KKQELNQKKQ LNIQLDVSFA IKGIVANSPA QKAGLLIGDV IQKINGKPIK
SLAQAQKIIE FSTVGDILTI EVHRNGKTQI FKIRSGTYPH K