Gene Aazo_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2079 
Symbol 
ID9339873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2162024 
End bp2163286 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content41% 
IMG OID 
ProductHtrA2 peptidase 
Protein accessionYP_003721250 
Protein GI298491073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.376096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGA TTTATAATCA TCACCTAGCA TCCAAAAATC TTAATACCAG GTTTTTGCTC 
CCCTGTATAA GAGCTAAAAG AAAAGGGTGG ATGTTAATGA CTGGGTTAGC AGCGGTGGTT
CTTAGTGGCT GTTCTAACCT GAATAGCAGA ACCTTAGAAT CAGAACAGAG CCTTGCGGAA
GTCCAAAGAA CTACTGCTCC CACTTCGGTA ATTATGCCTT CTGCCATCAT CGCATCCTCC
GGCGATCCTA ACTTTGTAGT CGAAGTAGTA CAAAAAGTGG GAGGTGCTGT TGTTCGCATT
GACTCTGCCA GAACCGTTAC ATCTCAAGTA CCAGATGAAT TTTCCGATCC ATTTTTCCGC
AGATTTTTTG GTGACAGAAT TTCCCAAGGA AGACAAAGAG TAGAAAGGGG TAGCGGTTCT
GGATTTATTA TTAATTCCTC TGGGCAAATT CTGACTAATT CTCATGTCGT AGATGGCGCA
GATCAAGTTA CAGTTACACT CAAAGATGGA CGGACTTTTG ACGGTAAAGT CTTGGGAGAA
GACCCAGTTA CAGATGTAGC AGTAATTCAA ATTAATGCTA ATAATTTACC AATTTTAGCT
CTAGGGAATT CCAATACCTT GCAACCAGGA GAAGCCGTAA TTGCAATTGG TAATCCCTTG
GGTTTAAACA ATACTGTAAC TTCAGGAATT CTTAGTGCTA CAGACCGTTC TAGTAGTGCT
ATTGGTGCTA GTGATAAGCG GGTTGATTAT TTGCAAACAG ATGCAGCAAT TAATCCTGGA
AACTCTGGTG GTCCACTGCT AAATTCTGGT GGTAAAGTGA TAGGAATGAA CACAGCTATT
ATCCAAGGCG CTCAAGGTTT AGGCTTTGCA ATTCCTATTA ATACAGTGCA AAAAATTTCC
CAAGAATTAA TTAGCAAAGG TAGGGTAGAT CATCCTTACT TGGGTGTAGA AATGGTGACG
CTAACACCAG AACTCAAGGA AAGAATAATT CGTAGATCTG GTAATAGAGT AAATTGGGTT
GCAGATCAAG GCGTTTTGTT AGTGAGAATT GTTTCTGAGT CACCAGCCGC AATTGGTGGA
CTCAAACCAG GGGATGTGAT GAAAACCATT AATAATCAAC CTGTTACCAA GGTCGATGAA
GTACAAAAAC TAGTGGAAAA TAGTCAGATT GGTACTCCTC TTCAAGTCCA AGTAGACCGT
CAGGGCAGAA CTGTTCAACT AACAGTCAGT CCTGCTCCTT TACCAGTGCC TTCTGAAAAT
TGA
 
Protein sequence
MMKIYNHHLA SKNLNTRFLL PCIRAKRKGW MLMTGLAAVV LSGCSNLNSR TLESEQSLAE 
VQRTTAPTSV IMPSAIIASS GDPNFVVEVV QKVGGAVVRI DSARTVTSQV PDEFSDPFFR
RFFGDRISQG RQRVERGSGS GFIINSSGQI LTNSHVVDGA DQVTVTLKDG RTFDGKVLGE
DPVTDVAVIQ INANNLPILA LGNSNTLQPG EAVIAIGNPL GLNNTVTSGI LSATDRSSSA
IGASDKRVDY LQTDAAINPG NSGGPLLNSG GKVIGMNTAI IQGAQGLGFA IPINTVQKIS
QELISKGRVD HPYLGVEMVT LTPELKERII RRSGNRVNWV ADQGVLLVRI VSESPAAIGG
LKPGDVMKTI NNQPVTKVDE VQKLVENSQI GTPLQVQVDR QGRTVQLTVS PAPLPVPSEN