Gene Aazo_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0565 
Symbol 
ID9338351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp591624 
End bp592877 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content43% 
IMG OID 
ProductRpoD subfamily RNA polymerase sigma 70 subunit 
Protein accessionYP_003720184 
Protein GI298490007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAA CATCTTTTCA TACAGATACC GCTTACGATT CCCCAAGGTC TAATCCAAAA 
TTAGAGCCTG ATTTGGGTAT TGATGATGGT GATTTTTCCC TAGATGATCT ACAGGATTTG
GATATAGCTG CTGCTGTTGA TTCTCATAAT CTAGCTGCAA ACACTAACCG TCGCAGCACA
GACTTAGTAC GTTTATACCT GCAAGAAATT GGTCGGGTGC GTTTGTTGGG GCGGGATGAA
GAAGTTTCCG AAGCTCAAAA AGTGCAGCGG TATTTGCGGT TGCGGATAGT GCTTGCTAAT
GCTGCCAAGC AAGGTGATAC TGTGATTGTA CCCTATCAGC GGTTAATAGA AGTTCAGGAG
CGTTTGTCAT CTGAACTGGG ACATCGTCCA TCATTGGAAA GATGGGCTAA GACTGCTGGT
ATAGACTTAG CGGATTTGAA GCCAACTTTG TTAGAGGGTA AACGTCGTTG GGCTGAGATT
GCCAAATTGA AAGTAGAAGA ACTGGAAAAT GTTCAATCCC AAGGACTCCA AGCCAAGGAA
CACATGATTA AGGCGAATCT TCGTCTGGTG GTTTCTGTGG CCAAGAAATA TCAAAATCGT
GGTTTGGAAT TGTTAGATTT AGTCCAAGAA GGGACTCTGG GTTTAGAGCG AGCTGTGGAA
AAATTTGACC CAACTAAGGG TTATCGTTTT AGTACCTATG CTTATTGGTG GATTCGTCAG
GGAATTACCA GAGCGATTGC TACCTCTAGC CGGACAATTC GCCTCCCTGT TCATATTACA
GAAAAACTGA ACAAAATTAA AAAAGCACAA CGTAAAATCT CTCAAGAGAA AGGTCGTACT
CCCACTTTAG AAGATCTAGC AATTGAATTA GACATGACAC CTACTCAAGT TCGGGAAGTG
TTGTTGAGAG TACCCCGTTC TGTTTCTTTA GAAACCAAAG TCGGAAAAGA TAAAGATACC
GAGTTAGGGG AATTGCTAGA GACTGATAGT ATCACCCCAG AAGAAATGTT AATGCGGGAA
TCTTTACAAA AAGATTTGCA CCATTTACTG GCAGATTTAA CCAGTCGAGA ACGGGATGTG
ATCCTGATGC GGTTCGGTTT ATCTGATGGT CATCCTTACT CCTTGGCCGA AATTGGTCGC
GCTCTAGATT TATCACGGGA ACGGGTACGA CAAATTGAAT CCAAAGCTTT GCAAAAGCTT
CGTCAACCTA AGCGCCGTAA CCTGATTCGG GACTATTTGG AATCTTTGAG TTAG
 
Protein sequence
MPATSFHTDT AYDSPRSNPK LEPDLGIDDG DFSLDDLQDL DIAAAVDSHN LAANTNRRST 
DLVRLYLQEI GRVRLLGRDE EVSEAQKVQR YLRLRIVLAN AAKQGDTVIV PYQRLIEVQE
RLSSELGHRP SLERWAKTAG IDLADLKPTL LEGKRRWAEI AKLKVEELEN VQSQGLQAKE
HMIKANLRLV VSVAKKYQNR GLELLDLVQE GTLGLERAVE KFDPTKGYRF STYAYWWIRQ
GITRAIATSS RTIRLPVHIT EKLNKIKKAQ RKISQEKGRT PTLEDLAIEL DMTPTQVREV
LLRVPRSVSL ETKVGKDKDT ELGELLETDS ITPEEMLMRE SLQKDLHHLL ADLTSRERDV
ILMRFGLSDG HPYSLAEIGR ALDLSRERVR QIESKALQKL RQPKRRNLIR DYLESLS