Gene Aazo_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0500 
Symbol 
ID9338286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp514514 
End bp516619 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content42% 
IMG OID 
ProductOligopeptidase A 
Protein accessionYP_003720142 
Protein GI298489965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.452158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAA CTGCTATTAT TTCTCAAAAT CCTCTACTCC AAGGCTTTGG TTTACCTGCA 
TTTGCAGAGA TTACACCAGA ACAAGTAGAA CCAGCTTTCA GGCATCTGTT AGCAGATTTG
GAACAACAGC TAAATATTTT AGAAGCTAAT GTCCAACCTA CTTGGGATGG TTTAGTAGAA
CCTCTAGAAA AGCTAACAGA AAGGCTGCAT TGGAGTTGGG GCATCCTGAA CCATCTAATG
GGTGTTCAAA ATAGCCCAGA GCTACGGATA GCTTATCAAA AGGTTCAGCC ACTAGTAGTA
CAGTTTATTA ATACCCTTGG ACAAAGTAAA CCTATATACA AGGCTTTTAA AGCACTCCGC
GCCAGCGATA CTTGGGAAAC CTTAGAATCA GCCCAGCAGC GGATTATTGA AGCGGCTATT
CGAGATGCAA AATTGTCTGG TGTCGGCTTA GAAGGACAAG CGCGAGAGCG TTTTAATGCT
ATTCAGATGG AGTTAGCGGA ACTGGCTACC AAGTTTTCTA ATCATCTTTT GGATGCGACT
ACAGCTTTTA GCTTAATTCT CACTACCAAA GCAGAAATAG AGGGTTTACC TAGCAGTTTA
CTAAGTTTAG CTGCCCAAGC TGCTCGCATT GCTGGAGAAG AATATGCGAC TCCAGAAACA
GGTCCTTGGC ATATTACGTT GGATTTTCCC AGTTATTTCC CGTTTATGCA GCACAGCACT
AGGCGGGATT TGCGCGAAAA ACTTTACAAG GCTTATATTA CCCGTGCTTC TTTCGGGGAA
TTGGATAATA ATCCTTTGAT TGAACGTATT TTAGAACTGC GGCAAGAATT GTCAGAGTTA
CTTAGCTTTG ATAATTTTGC TGAATTGAGT TTGGCTAGTA AGATGGCGAA GAATGTCCCC
GCAGTGGAAA AGCTGTTAGA AGAACTACGC CAAGCTAGTT ATGATGCGGC TGTTAAAGAT
TTAGAAGCAC TTAAAGCTTT TGCTCAATCC AAGGGAGCAG CAGAAGCAGA TAATTTACAA
CATTGGGATA TCAGCTTTTG GGCTGAACGT CAAAGAGAAG AAAAATTTGC TTTTACTGAA
GAGGAATTAC GTCCTTATTT CCCTCTTCCC CAGGTGCTAG ATGGTTTATT TGGGCTAATT
AAACGCCTGT TTGGTGTAAC GGTGACACCA GCTGATGGTC AAGCCCCTGT TTGGCATGAA
GATGTACGCT ATTTTAAAAT CTCTGATAAA TATGCTAATG CGATCGCCTA CTTTTATCTA
GACCCCTACA GCCGTCCCGC TGAAAAACGT GGTGGTGCTT GGATGGATGC CTGTATTCAT
CGCCGCAAAA TCACAGAACT TGGTATAACT AGCATCCGCT TACCTGTAGC TTATTTGATT
TGCAACCAAA CTCCTCCAGT TGATCACAAG CCTAGTTTAA TGACTTTTGA TGAAGTTGAA
ACCTTATTCC ACGAATTTGG ACATGGTTTA CACCACATGC TCACCAAGGT TAATTATACT
GGAGCTGCAG GCATCAATAA CGTTGAATGG GATGCAGTAG AATTACCTAG TCAATTCATG
GAAAACTGGT GTTATGACCG TCCCACTTTG TTTGATATGG CTAAACATTA TGAAACAGGT
AAACCTCTAC CAGAACATTA TTATCAAAAG CTCTTAGCAT CACGTAATTA TATGAGTGGT
TCGGCAATGT TGCGTCAAAT TCACCTCAGC AGCGTGGACT TAGAACTACA CTACCGCTAT
CGTCCTAGTG GTGACGAAAC TCCCATAGAT GTGCGTCAAC GCATTGCTAA AACCACTACC
GTTTTACCGC CACTACCTGA AGATGGTTTT CTATGTGCAT TCGGTCACAT TTTTGAAGGA
GGTTATGCAG CCGGTTACTA CAGCTACAAA TGGGCTGAAG TTCTCAGTGC GGATGCTTTT
GCTGCTTTTG AAGAAGCTGG ACTAGAAGAT GAAGAAGCAA TTTATGTTAC TGGTAGACTT
TACAGAGACA CAGTATTAGC ACTGGGCGGT AGTATGCACC CAATGGAAGT TTTTCAAGCC
TTCCGACATC GAGAACCGAG TACCACGGCT TTACTCAAAC ATAATGGTTT GTTGCCTACT
ATCTAA
 
Protein sequence
MSATAIISQN PLLQGFGLPA FAEITPEQVE PAFRHLLADL EQQLNILEAN VQPTWDGLVE 
PLEKLTERLH WSWGILNHLM GVQNSPELRI AYQKVQPLVV QFINTLGQSK PIYKAFKALR
ASDTWETLES AQQRIIEAAI RDAKLSGVGL EGQARERFNA IQMELAELAT KFSNHLLDAT
TAFSLILTTK AEIEGLPSSL LSLAAQAARI AGEEYATPET GPWHITLDFP SYFPFMQHST
RRDLREKLYK AYITRASFGE LDNNPLIERI LELRQELSEL LSFDNFAELS LASKMAKNVP
AVEKLLEELR QASYDAAVKD LEALKAFAQS KGAAEADNLQ HWDISFWAER QREEKFAFTE
EELRPYFPLP QVLDGLFGLI KRLFGVTVTP ADGQAPVWHE DVRYFKISDK YANAIAYFYL
DPYSRPAEKR GGAWMDACIH RRKITELGIT SIRLPVAYLI CNQTPPVDHK PSLMTFDEVE
TLFHEFGHGL HHMLTKVNYT GAAGINNVEW DAVELPSQFM ENWCYDRPTL FDMAKHYETG
KPLPEHYYQK LLASRNYMSG SAMLRQIHLS SVDLELHYRY RPSGDETPID VRQRIAKTTT
VLPPLPEDGF LCAFGHIFEG GYAAGYYSYK WAEVLSADAF AAFEEAGLED EEAIYVTGRL
YRDTVLALGG SMHPMEVFQA FRHREPSTTA LLKHNGLLPT I