Gene Aazo_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3690 
Symbol 
ID9341495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3754034 
End bp3755224 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content43% 
IMG OID 
ProductHtrA2 peptidase 
Protein accessionYP_003722368 
Protein GI298492191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTT TCAAATTACC CCGGTCTATA CGCCAAGTTA GTACCCATGT TTTAGCGATT 
TTCCTAGGAG TTTTGCTGAC GGTTACTTCT TTGGAAGTTT TGCCATCTCA AGCTGAACCT
GCTCCTCCTC TGGTTCCTCA AAAATCATCT CCTGTGGCTG CGGCTATTGG TAATAGTAGT
TTTGTGACGG CTGCAGTGAA TCGTGTTGGA CCGGCTGTGG TCAGGATTGA TACAGAACGG
ACGATTACTC GTCCTACTGA TCCTTTGATG GATGACCCGT TTTTGCGGAG GTTTTTTGGT
GATGGGTTTC CTCAACAGTC ACCGACTGAA CAATTACGCG GTCTGGGTTC TGGGTTTATT
TTCGATAAAA GCGGGATAGT TCTGACTAAT GCTCATGTAG TTGATCAAGC TGATAAGGTG
ACAGTCCGCC TCAAAGATGG CCGCACTTTT GAAGGTAAAG TTAAAGGTAT TGATGAAGTG
ACTGATTTGG CGGTGGTTAA GATTAATGCT GGTAATGATT TACCTGTTGC TTCTTTAGGT
TCTTCTCAGA ATGTCCAGGT GGGAGACTGG GCGATCGCAG TGGGAAATCC TTTAGGATTT
GATAATACTG TAACTTTAGG TATTATTAGT ACCTTAAAAC GTTCTAGTGC CCAAGTGGGA
ATTAGTGATA AACGTTTAGA TTTCATTCAA ACTGATGCTG CTATTAACCC TGGTAACTCC
GGTGGTCCTT TATTAAATGC TGAAGGAGAA GTGATTGGGA TAAATACAGC GATTCGTGCA
GATGCTATGG GGATTGGGTT TGCAATTCCT ATTGATAAAG CTAAGGCCAT CGCAACTCAA
TTGCAACGAG ATGGTAAAGT TGCTCACCCC TATTTGGGCG TACAAATGGT GACATTGACA
CCCCAGTTAG CGCAACAAAA TAACATTGAT CCAAATTCCA TGTTTGAGAT TCCAGAAGTG
CGCGGCGTTT TGGTGATGAG GGTTGTACCC GGTTCTCCTG CTGCTACTGC GGGTATCCGT
CGCGGAGATG TCATTGTTAA AATTGATGAT CAAGTAATTA CAAGCGCCGA TCAATTGCAA
AGGGTAGTAG AAGATAGTCG TCTAGGTCAA ACTTTCCAGC TAAAAGTGCA ACGAGGTAAT
CAAACACAAA TACTTTCAGT GCGTACTGCT GAGTTGAAAG ATATTCAGTA G
 
Protein sequence
MQIFKLPRSI RQVSTHVLAI FLGVLLTVTS LEVLPSQAEP APPLVPQKSS PVAAAIGNSS 
FVTAAVNRVG PAVVRIDTER TITRPTDPLM DDPFLRRFFG DGFPQQSPTE QLRGLGSGFI
FDKSGIVLTN AHVVDQADKV TVRLKDGRTF EGKVKGIDEV TDLAVVKINA GNDLPVASLG
SSQNVQVGDW AIAVGNPLGF DNTVTLGIIS TLKRSSAQVG ISDKRLDFIQ TDAAINPGNS
GGPLLNAEGE VIGINTAIRA DAMGIGFAIP IDKAKAIATQ LQRDGKVAHP YLGVQMVTLT
PQLAQQNNID PNSMFEIPEV RGVLVMRVVP GSPAATAGIR RGDVIVKIDD QVITSADQLQ
RVVEDSRLGQ TFQLKVQRGN QTQILSVRTA ELKDIQ