Gene Aazo_5174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5174 
Symbol 
ID9342981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5297703 
End bp5299754 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content46% 
IMG OID 
Productexoribonuclease II 
Protein accessionYP_003723348 
Protein GI298493171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGAAGG GGACGCTAGT TGAATTTAGG GTTCAAGGCG ATCGCCGTCT GGGAGTGGTA 
GATCGTCCAG ACGGAAAGAC CCGTTGGTTT GTAGTAGATG AACGAGGTCA ATCCCACAGC
CTCGCGCCTC GACAACTAAC CTATACAGTA AACGGGGAAA CTTACAAACC CTCAGATATT
GCCCAATTTT CAGAACAAGT CAAACCGTAC CTAGATCCAT CTAGCTTAGA AGTGGCTTGG
GAATTATTGG TGGAAGATGG AGAAACAGTC ACCCCTAGCC AAATGGCTAA TTTGCTGTTT
TCACAATCTG AACCGCCATA CTGTTACGCT GCTCATTGCT TGTTATCAGA AGACAAACTC
TATTTCAAGC AAAAAGGTGA AGCTTATGAA CCCCGCAGCG CAGCACAAGT AGCAGAACGT
AAGCATCAAA TAGAAGTAGA AGCGCAAAAA GCTAAGGGAC AGCAGGAATT TTTAGCGCGT
GTAGAACAGG CACTCAAGGG TGAAGCAGTA GAATGGCAAC GGCAAGACCG TCAGCGTTTG
GAAGCATTAG AAAAGTATGC ATCCTTAGTG GCGGATATTG TGCGGACGGG GGTAAACTCC
GATTCTTTAG CCCGCGCTTA CCCACCTCCA GCCCCAGTCT TAGAAACCAT GAATATGCTG
GGACGTTCTG GTACACCTCT AGCAGCCTTT CAACTGTTGA TCGACTTGGG TTGGTGGGGT
CCGCATGAGA ACCTGTTCCT GCGTCGTTCT TCAATTCCCG TCCAGTTTCC CAACAAGGTA
TTAGAAGTGG CGCAACAACG CTTGGATTTT CCACCAACTG ACTTAGATAC AAATCGACTG
GATCTAACTC ATCTCAAGGT ATACACAATT GATGATGAAA GTACCACGGA AATAGATGAT
GGTCTAAGTT GGGAAGTATT ACTAGATGGA CAGGAACGGC TATGGGTGCA TATTGCTGAC
CCTACGCGGT GGTTAATGCC AGAAGATGAA TTAGATTTAG AAGCCAGAAA GCGGGGAAGC
ACTGTTTATT TACCGACGGG GATGATTCCC ATGTTCCCGG AGGTACTAGC AACTGGTCCG
ATGAGTTTGG TACAGGGGAA AATTTGTTAC TCCCTCAGCT TTGGCATAGT TTTGGACGAA
ACTGGGGCTG TGGAAGATTA CTGTATTCAT GCCAGCTTGA TGAAGCCTAC CTATCGTCTC
ACCTATGAAG ATGTAGATGA GATGCTGGAA TTAGGGGTAG AAGCAGAACC AGAAATTGCT
GCGATCGCAA ATTGGGCAAA AAAGCGTAAA ACCTGGAGAT ATAACCAAGG AGCCATCAGC
ATCAATATGC CAGAGGCAAT GATTAAAGTC AAAGGCGATG ATGTCACCAT AGATATTTTA
GATGATTCCT CCTCCCGGCA ATTAGTTGCC GAAATGATGA TTCTTGCGGG AGAAGTAGCC
GCACGTTACG GTCAAACCCA TAACATTCCC CTACCCTTCC GTGGTCAACC CCAACCAGAA
CTACCACCAG AAGAAGAATT ACTCCTACTT CCCGCAGGCT TTGTTCGTGC CTGTGCCATG
CGTCGGTGTA TGCCCAAGAG CGAAATGAGT ATTACTCCTG TGCGCCATGC TGGTTTGGGA
CTAGATACCT ACACCCAAGC AACTTCACCA ATTCGTCGTT ACAGCGACCT ATTAACCCAC
TTCCAACTCA AGGCACACCT GCGGGGTGAA GATTTGCCCT TTTCAGCCGA ACAACTCAAA
GAAGTGATGA TGACCGTCAC CACTACCACC CAAGAAGTGA CAATGGTGGA ACGACAAACT
AACAGATATT ATGCTCTAGA ATATTTGCGT CGTCATCCTG AACAGATATG GCAAATCACA
GTTTTGATGT GGTTACGAGA AGATAGCAAT TTAGCATTAA TTCTGTTAGA AGATTTAGGT
TTACAATTGC CAATGGCCTT TAGAAGGACG GTCAATTTAG GAGAACAATT ATTAGTGAAA
GTGAGCCTTG CTGATCCACA GAAAGATATG ATTCAGTTTC AAGAAATAAT TTATCAAGAA
GCTGCTCTTT AA
 
Protein sequence
MEKGTLVEFR VQGDRRLGVV DRPDGKTRWF VVDERGQSHS LAPRQLTYTV NGETYKPSDI 
AQFSEQVKPY LDPSSLEVAW ELLVEDGETV TPSQMANLLF SQSEPPYCYA AHCLLSEDKL
YFKQKGEAYE PRSAAQVAER KHQIEVEAQK AKGQQEFLAR VEQALKGEAV EWQRQDRQRL
EALEKYASLV ADIVRTGVNS DSLARAYPPP APVLETMNML GRSGTPLAAF QLLIDLGWWG
PHENLFLRRS SIPVQFPNKV LEVAQQRLDF PPTDLDTNRL DLTHLKVYTI DDESTTEIDD
GLSWEVLLDG QERLWVHIAD PTRWLMPEDE LDLEARKRGS TVYLPTGMIP MFPEVLATGP
MSLVQGKICY SLSFGIVLDE TGAVEDYCIH ASLMKPTYRL TYEDVDEMLE LGVEAEPEIA
AIANWAKKRK TWRYNQGAIS INMPEAMIKV KGDDVTIDIL DDSSSRQLVA EMMILAGEVA
ARYGQTHNIP LPFRGQPQPE LPPEEELLLL PAGFVRACAM RRCMPKSEMS ITPVRHAGLG
LDTYTQATSP IRRYSDLLTH FQLKAHLRGE DLPFSAEQLK EVMMTVTTTT QEVTMVERQT
NRYYALEYLR RHPEQIWQIT VLMWLREDSN LALILLEDLG LQLPMAFRRT VNLGEQLLVK
VSLADPQKDM IQFQEIIYQE AAL