Gene Aazo_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4085 
Symbol 
ID9341890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4150961 
End bp4152808 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content44% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003722657 
Protein GI298492480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAT TTATATTACT GTGCTTGTTT CTTATCGGGT TAGTTACTGT CGTGTTCGGT 
TTCCTGAACG TCCAGGGATT GGCGAGCAAA GGTGAATTTG AGACAATTTT GCTAGATTTT
CGGGAAGATA TTCCAGCATT GGTGATTAAT CAGGATTTGC AACTGATCGC TCAACAATAT
CATATTACAC CCCGACTGGA TAACCAATTC TCAGCGGCTG ATCATGTGTA TATTATCAAA
GGAGATCGCC AAGGGCTGCA AGATTTAAGA AAATCTCCCT TTGCTGAAGC CACAGAGTTC
ATCGAACCAA ATTACATTTA CAGGAAAGTT CCGGAAGGGA AGACTACAGC ACTGGGAGAA
CAGTTCCTAC CCCAAAACAA TCAAAATCCT AAACCTTCAT TAATTGGCCC CAACGACCAA
TATTACAGCA AACAGTGGAA CCTCCACAAA ATTGGCATAG AAGGCGCATG GACTCGCACT
AAAGGGAGTG GCATAACAGT TGCAGTCATT GACACAGGTA TCACTCAGGT GCGCGACTTA
GCAGAAACAA AATTTGTTAA AGGCTACGAC TTCGTAAACG ACACAGAAAT AGTCAAAGAC
GACAACGGAC ATGGCACCCA TGTAGCCGGC ACAGTCGCCC AAACCACTAA TAATCAATAT
GGTGTAGCTG GAGTCGCCTA CGAAGCTAGT CTCATGCCCT TAAAAGTGTT AAATGCAGAT
GGTAGTGGTA CAGTTGCCGA CATCGCCGAA GCCATCAAAT TTGCCGCAGA TAAAGGCGCA
GATGTTATTA ATATGAGCTT AGGTGGTGGT GGTGAAAGTA AACTCATGCA AGATGCCATT
GAGTACGCCT ACAAAAAAGG TGTAGTTATT ATTGCCGCAG CCGGAAATGA AAGTACAGAT
GGGGCGAGTT ATCCAGCCCG TTATCCTCAT GTAATTGGCG TTTCTGCCTT TGGCCCAGAC
GGAGAAAAAG CATCCTACTC TAACTTTGGT GCTGGTGTAG ATATCTCCGC CCCTGGTGGT
AGTGAAACAG GAACAATTCT CCAGGAGACC ATTGACGAAA ACGGCCAAGG GCTATTTTTG
GGACTCCAAG GCACAAGTAT GGCCTCTTCA CACGTTGCAG GTGTGGCAGC TTTAATTAAA
GCATCTGGAG TCACAGAACC TGATCAAATT TTAAAAGTCC TCCAACAGTC AGCCAGAGTT
ATCCAAGACG ACGCTTTAAA TTATTACGGT GCTGGACAAC TTAACGCCGA AGCAGCAGTC
AAACTAGCCA GCGAAGGACA AATTAGTTTT CCAGACTTCT TTCGGTGGTT GCGGGATAAC
GGCTATATCA ACCCTGGTTT TTGGATTGAT GGCGGTGCGA TCGCGCTAGT ACCTAAAATA
TTAATGGTAG TAGGTTCCTA TCTCCTCGCT TGGTTTCTAC GGGTTTACTT CCCCTTCGCT
TGGAGTTGGT CTTTATCTAG TGGCTTAATT TTTGGTAGTT CTGGACTCTT CTTCCTGAAG
GGATTTTATA TCTTTGACCT TCCCCAGTGG CCTTTCCGAG TTTTGGGCAG TTCTCTTCCC
GAACTAGGTA ACAGCTTACA GGGAACAGGC ACTTTAAATC CTCTTTTTGC CAGTGTGCTA
ATTCCTGTTG TGTTGATAGT ATTCCTCCTA GGACATCCCA ATTGGAAGTG GTTTGCTGTT
GGTTCTACCC TTGGCATAGC GGCTTGTTTA ACAATCAGTG CCATTTATGA CCCTGCTGTT
TGGGGACTAG GAGATGGTAA CATAGCCCGT ATTTTTCTCA TCGTTAATGC TTTACTTTGT
TATGGATTGG TACGTTTAGC ATTAAAAGAA GACAAACAAA CAGCTTAA
 
Protein sequence
MRRFILLCLF LIGLVTVVFG FLNVQGLASK GEFETILLDF REDIPALVIN QDLQLIAQQY 
HITPRLDNQF SAADHVYIIK GDRQGLQDLR KSPFAEATEF IEPNYIYRKV PEGKTTALGE
QFLPQNNQNP KPSLIGPNDQ YYSKQWNLHK IGIEGAWTRT KGSGITVAVI DTGITQVRDL
AETKFVKGYD FVNDTEIVKD DNGHGTHVAG TVAQTTNNQY GVAGVAYEAS LMPLKVLNAD
GSGTVADIAE AIKFAADKGA DVINMSLGGG GESKLMQDAI EYAYKKGVVI IAAAGNESTD
GASYPARYPH VIGVSAFGPD GEKASYSNFG AGVDISAPGG SETGTILQET IDENGQGLFL
GLQGTSMASS HVAGVAALIK ASGVTEPDQI LKVLQQSARV IQDDALNYYG AGQLNAEAAV
KLASEGQISF PDFFRWLRDN GYINPGFWID GGAIALVPKI LMVVGSYLLA WFLRVYFPFA
WSWSLSSGLI FGSSGLFFLK GFYIFDLPQW PFRVLGSSLP ELGNSLQGTG TLNPLFASVL
IPVVLIVFLL GHPNWKWFAV GSTLGIAACL TISAIYDPAV WGLGDGNIAR IFLIVNALLC
YGLVRLALKE DKQTA