Gene Aazo_4771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4771 
Symbol 
ID9342578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4871120 
End bp4872220 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content38% 
IMG OID 
ProductPfpI family intracellular protease 
Protein accessionYP_003723072 
Protein GI298492895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATA ATAATCATAT TATTGGAGAT CAAAAAGTTG CTATCCTGAT TGAACAAGGA 
GTAGAAGACG TAGAATTTAT TGTTCCTTTT AATGGTTTGA AACAAGCAGG AATAGAGGTA
ATAGTGCTTG GTTCACGGAT GAATGAAAAA TATAAAGGTA AACGAGGCAA ACTCAGCATC
CAAGCTGATG CAACGACAAC AGAAGTTGTG GCTGATGAAT TTGCAGCAGT GGTAATTCCT
GGTGGTATGG CTCCTGATAA AATGCGCCGC AATTGTAATA CAGTTTGGTT TGTAATGGAG
GCTATGAAGC AAGGTAAATT AATAGCCGCA GTATGCCACG GTCCACAGGT TTTAATTGAA
GGTGATTTAC TGAAAGGTAA ACAAGTAACA GGATTTGCTG CTATTTGCAA AGACATAACT
AATGCTGGTG CCAATTATCT AGATGAACCA GTAGTTGTGG ATGGTAATTT GATTACATCT
CGTGAACCTG GAGACTTGGC AATTTTTACA ACGGTACTGT TAAATCGTTT AGGTTATGGT
GGTAAAGATG CTGTTTTACC TAATGAAAAA GATACTGGTG CTGAATGGTG GAAATTAGCT
GATGCTTGGG GTGGTTCAAC AAAAAACGAA ATTGTTAAAG GTTTGAATAC TGCTTTAGGT
GGTGAGCGTT ATTCGTTAGA AGCTTTAGAA AAATATTTAG AGAAAGAATC AGATGAGGAA
GTGAAAAATC TGTTTCAAGA GATGATAACT AATAAAAATC AGCACATTAA AAAGCTGGAA
AGTTATCTTC ATCGTTTCCA TGAAAAACCG TCTTTGACTG CAAATATTGC TAATCAATAT
GCTAAGCTTA AAACAGCTTT AACGGGTAGT GAGAGTATCT ATCAAATTCG TTGTGCATTG
GGTGATATAC AAACAGCAAT TGGTGATATT ACCAACTTGT CTGCAATGCT TACTGACCCA
GTAGCAACGG CAATTTTTAA ACAAATTCAC AACGATTTGG GTAAATATGA ACAGCGATTG
ATTGAGCTTT ATCGAGGGCG GATTGCTGCT GGTGTGAGGC CTCCTAAACC AACTTCTAGG
GCGGCTGTAA CTCAAGTTTA A
 
Protein sequence
MRNNNHIIGD QKVAILIEQG VEDVEFIVPF NGLKQAGIEV IVLGSRMNEK YKGKRGKLSI 
QADATTTEVV ADEFAAVVIP GGMAPDKMRR NCNTVWFVME AMKQGKLIAA VCHGPQVLIE
GDLLKGKQVT GFAAICKDIT NAGANYLDEP VVVDGNLITS REPGDLAIFT TVLLNRLGYG
GKDAVLPNEK DTGAEWWKLA DAWGGSTKNE IVKGLNTALG GERYSLEALE KYLEKESDEE
VKNLFQEMIT NKNQHIKKLE SYLHRFHEKP SLTANIANQY AKLKTALTGS ESIYQIRCAL
GDIQTAIGDI TNLSAMLTDP VATAIFKQIH NDLGKYEQRL IELYRGRIAA GVRPPKPTSR
AAVTQV