Gene Aazo_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1016 
Symbol 
ID9338811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1083601 
End bp1086969 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content48% 
IMG OID 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_003720506 
Protein GI298490329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAACG AAACGTATAT GGAATCCGCC TTTCTGTTAC CCGACTTGAT TGAAATCCAG 
CGTTCAAGCT TTCGCTGGTT TTTGGAAGAA GGGCTAATAG AAGAGTTGAA CTCCTTTAGT
CCAATTACAG ACTATACAGG GAAACTAGAA CTGCACTTTT TAGGTGATAA GTACAAACTT
AAAGAACCAA AGTACAGCGT CGAAGAATCG AAAAGGCGGG ATAGCACTTA TGCAGTTCAA
ATGTATGTCC CCACAAGGCT TTTAAACAAA GAAACAGGGG ATATCAAAGA ACTAGAAGTA
TTTATAGGTG ATCTGCCTTT GATGACAGAT CGGGGTACGT TTATCATTAA CGGAGCTGAG
CGAGTCATTG TCAACCAGAT AGTGCGATCA CCAGGAGTTT ATTACAAATC AGAACTTGAT
AAAAATGGAC GACGTACCTA TTCTGCCAGC TTAATACCCA ACCGGGGGGC ATGGCTGAAA
TTTGAAACAG ACCGTAACGA CCTAGTGTGG GTACGCATCG ACAAAACCCG GAAACTTTCA
GCCCAGGTAC TCCTCAAAGC CTTAGGATTA TCAGATAACG AAATCTTTGA TGCCCTACGC
CACCCCGAAT ACTTCCAAAA AACCATCGAA AAAGAAGGGC AATTTTCCGA AGAAGAAGCC
CTAATGGAGT TATACCGTAA ACTACGTCCA GGTGAACCAC CCACCGTATT AGGCGGACAA
CAACTCCTAG ACTCACGCTT CTTCGACCCG AAACGTTATG ACCTCGGTCG TGTCGGTCGC
TACAAACTCA ACAAAAAACT CCGCCTTTCT GTCCCCGATA CCATGCGCAT CCTAACCCCT
GGGGACATCT TAGCCGCAGT AGATTACCTA ATCAACCTAG AATATGACAT TGGTAGTATA
GATGATATTG ATCACCTCGG AAATCGCCGG GTTAGAAGCG TTGGAGAACT GCTGCAAAAC
CAAGTCAGAG TAGGGCTAAA CCGTTTAGAA CGGATAATTC GAGAACGGAT GACAGTATCT
GATGCCGAAG TTTTGACACC AGCATCTTTG GTTAACCCTA AACCCTTAGT AGCAGCAATT
AAAGAATTCT TTGGTTCTAG CCAACTAAGT CAGTTTATGG ATCAAACCAA TCCATTGGCA
GAACTAACCC ACAAACGCCG TCTCAGCGCC TTGGGTCCTG GTGGTCTAAC AAGAGAACGG
GCAGGTTTTG CAGTGCGAGA TATCCACCCC AGTCACTACG GACGGATATG CCCCATAGAA
ACACCAGAAG GGCCAAACGC AGGATTAATA GGTTCATTAG CCACCCATGC CCGTGTTAAC
CAATATGGAT TTTTAGAAAC CCCCTTTAGA CCAGTAGAAA ATGGCAAAGT GCGGTTTGAT
GTGCAGCCAA TTTACATGAC TGCTGACGAA GAAGACGACC TGCGGACAGC AACAGGTGAT
ATTCCCGTAG ATGAAAACGG CTACATCAAA GGGCCACAAG TACCAGTACG TTATCGTCAA
GACTGGGCGA CAACAACACC AGAACAAGTG GATTATGTAG CAGTATCACC AGTGCAAATT
GTGTCTGTTG CCACCAGCAT GATTCCCTTT CTGGAACATG ATGATGCTAA CCGGGCGCTG
ATGGGTTCCA ATATGCAACG GCAAGCTGTA CCCCTACTCA AGCCAGAACG TCCCCTAGTG
GGAACAGGCC TAGAAGCTCA AGGCGCGAGA GACTCCGGAA TGGTGATTGT TTCCCGTACC
GATGGCGACG TTGTGTATGT AGATGCAGCA CAAATTCGGG TGAGAGTCCG GGGTGATTTG
TCAGGAGTCA CGGGTAACTT GATAGGCGGA AAACAACAGA CAAACGAACA AGGACAACAG
GTAACAGACA AAGAAATTAG ATATGTATTG TCCAAATACC AACGTTCCAA CCAAGACACC
TGCCTGAACC AAAAACCCCT GGTTCGCATT GGCGAAAAAG TCGTAGCCGG ACAAGTGTTG
GCAGATGGGT CTTCCACCGA AGGAGGAGAA CTGGCACTAG GACAAAACAT TGTCGTGGCT
TATATGCCCT GGGAAGGCTA TAACTACGAA GACGCGATTT TGATTTCTGA GCGCCTGGTG
CAAGATGACA TTTATACCTC AATTCACATC GAAAAATATG AAATTGAGGC TAGACAGACA
AAGCTGGGAC CAGAAGAAAT CACCAGAGAA ATTCCCAACG TGGGGGAAGA TGCCCTCAGA
CAACTAGATG AACGGGGAAT CATCCGCATT GGTGCCTGGG TAGAAGCTGG AGACATTCTG
GTAGGAAAAG TCACACCAAA GGGTGAATCT GACCAACCAC CAGAAGAAAA ACTGCTGCGG
GCTATCTTTG GTGAAAAAGC GCGAGATGTG CGAGACAACT CCCTGCGAGT TCCCAATGGA
GAAAAAGGCC GGGTCGTGTA TGTGCGGTTG TTCACCAGAG AACAAGGGGA CGAACTACCA
CCAGGAGCAA ATATGGTGGT GCGGGTGTAT GTTGCCCAAA AACGCAAAAT CCAAGTCGGC
GACAAAATGG CAGGTAGACA CGGCAACAAG GGGATTATTT CCCGCATTCT CCCCGCTGAA
GATATGCCCT ACCTGCCTGA TGGTTCACCA GTGGATATTG TCCTCAATCC CTTGGGTGTA
CCCAGCCGGA TGAATGTTGG TCAAGTATTT GAATGTTTAT TAGGTTGGGC GGGTCACAAT
TTGGGAGTCA GATTTAAGAT CACTCCCTTT GACGAAATGT ATGGGGAAGA ATCTTCCCGC
AGGATTGTGC ATGGCAAGCT GCAAGAAGCT AGGGACGAAA CTGGCAAAAA CTGGGTATAT
AATCCCCATG ATCCCGGCAA AATTATGGTC TATGACGGAC GCACTGGCGA ACCCTTTGAC
CGTCCAGTGA CAGTGGGTGT AGCTTATATG CTCAAGCTGG TGCATTTGGT GGATGACAAA
ATTCACGCCC GTTCCACAGG ACCATACTCC CTTGTTACCC AACAACCTTT AGGTGGTAAG
GCGCAACAAG GTGGTCAAAG ATTTGGAGAA ATGGAAGTGT GGGCGCTGGA GGCCTTTGGT
GCGGCTTACA CCTTGCAGGA ATTATTAACT GTCAAATCCG ATGATATGCA AGGTCGGAAC
GAAGCCCTCA ATGCCATTGT AAAAGGTAAG GCCATTCCCC GTCCTGGTAC TCCTGAATCC
TTCAAGGTAT TGATGAGGGA GTTGCAGTCC TTGGGCTTAG ATATTGCTGT GCATAAGGTG
GAAACCCAAG CCGATGGCAG TTCCTTGGAT GTGGAAGTGG ACTTGATGGC AGACCAAGCC
TCCCGCCGTA CTCCACCGCG CCCCACCTAC GAGTCCCTTT CTCGTGAGTC ACTGGAAGAA
GACGAATAA
 
Protein sequence
MTNETYMESA FLLPDLIEIQ RSSFRWFLEE GLIEELNSFS PITDYTGKLE LHFLGDKYKL 
KEPKYSVEES KRRDSTYAVQ MYVPTRLLNK ETGDIKELEV FIGDLPLMTD RGTFIINGAE
RVIVNQIVRS PGVYYKSELD KNGRRTYSAS LIPNRGAWLK FETDRNDLVW VRIDKTRKLS
AQVLLKALGL SDNEIFDALR HPEYFQKTIE KEGQFSEEEA LMELYRKLRP GEPPTVLGGQ
QLLDSRFFDP KRYDLGRVGR YKLNKKLRLS VPDTMRILTP GDILAAVDYL INLEYDIGSI
DDIDHLGNRR VRSVGELLQN QVRVGLNRLE RIIRERMTVS DAEVLTPASL VNPKPLVAAI
KEFFGSSQLS QFMDQTNPLA ELTHKRRLSA LGPGGLTRER AGFAVRDIHP SHYGRICPIE
TPEGPNAGLI GSLATHARVN QYGFLETPFR PVENGKVRFD VQPIYMTADE EDDLRTATGD
IPVDENGYIK GPQVPVRYRQ DWATTTPEQV DYVAVSPVQI VSVATSMIPF LEHDDANRAL
MGSNMQRQAV PLLKPERPLV GTGLEAQGAR DSGMVIVSRT DGDVVYVDAA QIRVRVRGDL
SGVTGNLIGG KQQTNEQGQQ VTDKEIRYVL SKYQRSNQDT CLNQKPLVRI GEKVVAGQVL
ADGSSTEGGE LALGQNIVVA YMPWEGYNYE DAILISERLV QDDIYTSIHI EKYEIEARQT
KLGPEEITRE IPNVGEDALR QLDERGIIRI GAWVEAGDIL VGKVTPKGES DQPPEEKLLR
AIFGEKARDV RDNSLRVPNG EKGRVVYVRL FTREQGDELP PGANMVVRVY VAQKRKIQVG
DKMAGRHGNK GIISRILPAE DMPYLPDGSP VDIVLNPLGV PSRMNVGQVF ECLLGWAGHN
LGVRFKITPF DEMYGEESSR RIVHGKLQEA RDETGKNWVY NPHDPGKIMV YDGRTGEPFD
RPVTVGVAYM LKLVHLVDDK IHARSTGPYS LVTQQPLGGK AQQGGQRFGE MEVWALEAFG
AAYTLQELLT VKSDDMQGRN EALNAIVKGK AIPRPGTPES FKVLMRELQS LGLDIAVHKV
ETQADGSSLD VEVDLMADQA SRRTPPRPTY ESLSRESLEE DE