Gene Aazo_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3985 
Symbol 
ID9341789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4049087 
End bp4050256 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content42% 
IMG OID 
ProductRpoD subfamily RNA polymerase sigma 70 subunit 
Protein accessionYP_003722597 
Protein GI298492420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCAAA CAAAGCAACA ATCCCGAAAG GAAACTATGA ATCTTGCTGA ATTGGGAACA 
ATGGAAATAC TAGAGACTGC TGCTGATCAT GAAGAACCAT CACTTGATAG TTTAGAAGCA
GTAGTATTTG AAGACTCTTC AATCATAGAA AATTTGGAGT TAGATGAACG CGATGGCGAT
GAAATGGCCG CGGCTCGTCC TTCCGGATAC AATAAAACCG AACATGACGA TGCTGTAGGC
GCTTTTTTCA AAGAAATGGC GCGTTATCCC CTCCTTAAAC CTGATGAAGA AGTAGAATTA
GCACGACGAG TTAGGTTTTT AGAAGAAGTA AAAGACTTAC AAGCGGCTTT AGAAGAAGAA
CTAGGACAGC AACCAAGCAG AAGCGAAGTA GCTGCTAAGT TTGAGATGAC AGAAAAACAA
CTAGAAAGCC GCTTATATCA AGGACGGGTA GCCAAGCGAA AAATGATTCG CTCCAATTTA
AGGCTAGTAG TATCTATTGC TAAACGATAT CTTAACCGGG GAGTTCCTTT TCTAGATTTG
ATTCAGGAAG GAGCAATGGG TTTAAACCGC GCTACAGAAA AGTTTGACCC CGATAAAGGA
TATAAATTCT CAACCTACGC TTATTGGTGG ATTAGACAGG CAATTACAAG AGCGATCGCT
AATGATGCCC GCACCATTCG CTTACCGATA CATATTGTTG AAAAACTTAA CAAACTCAAA
AAAGCTCAAC GCGAACTAAA GCAAAAACTA GCTCGTAACC CCTCGGAAGC AGAAATGGCC
ACAGCCTTAG AAATTAGCAT CCAACAACTG CGTCAACTCC AACAACTGCG TCGTCAAGCA
CTCTCCCTTA ACCACCGTGT CGGTAAAGAA GAAGACACCG AATTAATGGA CTTACTAGAA
GACGAAGATA ACCAATCTCC AGAAGCAAAA ATGAACGAAA ACATGATGCG TCAGGAGATT
TGGGAAGTGT TAGGAGATGT CCTCACCCCA CGAGAAAAAG ACGTAATCTC TCTGCGCTAT
GGACTAACAA CCAGCGAACC CTGCACCCTA GAAGAAGTTG GTAATATGTT CAACCTTTCC
CGTGAACGAG TACGCCAAAT TCAAAGTAAA GCCATGCGAA AATTACGCCG TCCCCACATA
GCTAAACGTT TAAAAGGGTG GTTGATATGA
 
Protein sequence
MYQTKQQSRK ETMNLAELGT MEILETAADH EEPSLDSLEA VVFEDSSIIE NLELDERDGD 
EMAAARPSGY NKTEHDDAVG AFFKEMARYP LLKPDEEVEL ARRVRFLEEV KDLQAALEEE
LGQQPSRSEV AAKFEMTEKQ LESRLYQGRV AKRKMIRSNL RLVVSIAKRY LNRGVPFLDL
IQEGAMGLNR ATEKFDPDKG YKFSTYAYWW IRQAITRAIA NDARTIRLPI HIVEKLNKLK
KAQRELKQKL ARNPSEAEMA TALEISIQQL RQLQQLRRQA LSLNHRVGKE EDTELMDLLE
DEDNQSPEAK MNENMMRQEI WEVLGDVLTP REKDVISLRY GLTTSEPCTL EEVGNMFNLS
RERVRQIQSK AMRKLRRPHI AKRLKGWLI