Gene Aazo_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1705 
Symbol 
ID9339498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1770465 
End bp1771706 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content44% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003720978 
Protein GI298490801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.880099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTTCA TGAACAAACA GGTTTTTCGG TTGGGATTTT CATTACTTTT GGCTTTTTGT 
TTGGGTTTTG GCTCGCTTGT TTCACCTGCA ATGGCTTTAA CACAGGAGCA AAAGCTAGTT
TCTGAGGTTT GGCGAATTGT TAATCGCTCT TATCTGGATG AAACATTTAA TCATCAAAAC
TGGGCTGATG TACGTCAACA GGCGCTAAGG AAACCACTGC CAAATGACCA AGCAGCATAC
AGGGCTATTC AGAAGATGCT AAAAAGCCTT GATGACCCTT TTACCAGGTT TTTAGACCCA
GAGCAATACC GCAGTTTGCA AGTTAATACT TCTGGAGAAC TGACCGGAGT GGGTTTACAA
ATTGCGCTCA ATCCCCAGAC GGGTGGATTG GAGGTAATTA CACCTATAGA GGGTTCACCG
GCTGAGAAAG CAGGGTTAAG ACCTCGCGAT CGCATCTTGA AAATCGAAGG ATTATCTACA
GAAAATCTGA CTCTTGATGA AGCTGCTAAA CGGATGCGCG GTCCCGTTGG TAGTGTTGTA
ACTCTCTTGA TTGCACGAGA GGGAAAGGAA TACAAAGAAG TGATATTAGT GCGCGATCGC
ATAGAACTTA ATCCTGTAGT AGCTGAATTG CGTTTATCCC CCGAAGGAAA ACCCATTGGC
TACTTACGCC TAACTCAATT TAATGCTAAT GTGGTAATCA GGTTGGCAGA CGCTCTTAAT
AGCCTAGAAA AAAAAGGCGC AGTTGCCTAC ATTCTTGATT TGCGAAATAA TCCTGGTGGG
CTATTACAAG CCGGAATTGA AGTTGCCCGT CAGTGGTTAG ATTCAGGCAC AATTGTCTAC
ACTGTCAATC GTCAAGGTAT TCAGGGCAAT TTTGAAGCCC TTGGCCCAGC GTTAACACAA
GATCCCTTGG TGATTTTGGT GAATGAAGGA ACTGCTAGTG CTAGTGAAAT CCTTGCTGGT
GCCCTACAAG ACAATAAACG CGCCCAGTTA GTAGGTGAAA CGACCTTTGG TAAAGGTCTA
ATTCAATCTT TGTTTGAATT ATCAGATGGT TCAGGTTTAG CAGTGACAAT TGCCAAGTAT
GAAACTCCCA AGCACAGAGA CATTAACAAG TTAGGTATTA AACCAGACAA ACTAATTCCC
CAACAACCAA TTACACGGGA GCAAATTGGG ACGGAAGGGG ATAGTCAATA TCAAGCTGCA
ATGGAACTGC TAACCAAAGA TTTGGTTGTA GCTGGTTCGT AG
 
Protein sequence
MGFMNKQVFR LGFSLLLAFC LGFGSLVSPA MALTQEQKLV SEVWRIVNRS YLDETFNHQN 
WADVRQQALR KPLPNDQAAY RAIQKMLKSL DDPFTRFLDP EQYRSLQVNT SGELTGVGLQ
IALNPQTGGL EVITPIEGSP AEKAGLRPRD RILKIEGLST ENLTLDEAAK RMRGPVGSVV
TLLIAREGKE YKEVILVRDR IELNPVVAEL RLSPEGKPIG YLRLTQFNAN VVIRLADALN
SLEKKGAVAY ILDLRNNPGG LLQAGIEVAR QWLDSGTIVY TVNRQGIQGN FEALGPALTQ
DPLVILVNEG TASASEILAG ALQDNKRAQL VGETTFGKGL IQSLFELSDG SGLAVTIAKY
ETPKHRDINK LGIKPDKLIP QQPITREQIG TEGDSQYQAA MELLTKDLVV AGS