Gene Avin_12790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12790 
SymbolrpoN 
ID7760221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1245473 
End bp1246939 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID643804181 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_002798480 
Protein GI226943407 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGC AGCTGCAACA AGCCATCCGT CTGCTTCAAC TGTCGACCCT GGATCTGCAG 
CAGGAAATCC AGGAGGCCCT CGACTCCAAT CCCATGCTGG AACGTCAGGA GGACGCCGAG
GACTACGACA GCCCGGATAT GCTGGGCGAG CATGGAGACC AGTCGACGCT CGACACCACG
CCCGGCTCTT ACCAGGAAGG CTACGAGAGC GGGGCGGCCA GCGAGGATGG CGGTACCCTC
GAAGAGGGCG ACTGGCACGA GCGGATTCCC AGCGAGCTGC CGGTGGATAC CGCCTGGGAA
GACATCTACC AGACCAGTGC CAGCAACCTG CCGAGCACCG ATGAAGACGA GTGGGACTTC
ACCACCCGCA CCTCCACGGG CGAGAGCCTG CAGAGCCATC TGCTCTGGCA GTTGAACCTG
ACCCCGATGT CGGATACCGA TCGCCTGATC GCCGTCACTC TGATCGACAG CATCAACAGC
GACGGCTATC TGGAGGCCGC CCTGGAGGAA ATCCTCGCCT CTCTGGACCC GGAACTGGGA
GTCGAACTCG ACGAAGTGGA AATGGTGCTG CGCCGCATCC AGCAATTCGA ACCGGCCGGG
ATCGCTGCCC GCGACCTCAG CGAATCGCTG CTGCTGCAAC TGCGCCAGCT ACCGCCCGAT
ACCCCCTGGC TGGAAGAGGC GAAACGACTG GCCAAGGACT ATCTCGACCT GCTGGGTAAC
CGCGACTTCA CCCAGTTGAT GCGACGCATG AAACTCAAGG AAGAAGAATT GCGTCCGGTG
ATCGAGCTGA TCCAGAGCCT CAACCCTCGT CCCGGGGCCC AGATCGAGAG CAGCGAGCCC
GAATATGTCG TGCCTGACGT CATCGTGCGC AAGCACAACG ACCGCTGGCT GGTGGAGCTC
AATCAGGAGG CGGTGCCGCG CCTGCGCATC AACCCGCATT ACGCTGGCTT CATCAGACGC
GCCGACGCCA GCGCCGACAA CACCTTCATG CGCAACCAAC TGCAGGAGGC GCGCTGGTTC
ATCAAGAGCC TGCAAAGTCG CAACGAAACC CTGATGAAGG TTTCGACCCA AATCGTCGAG
CACCAGCGCG GCTTTCTCGA CTACGGCGAA GAGGCCATGA AACCGCTGGT GCTGCACGAT
ATCGCCGAGG CTGTCGGCAT GCACGAATCG ACCATCTCCA GGGTCACCAC CCAGAAATAC
ATGCACACTC CACGCGGTAT TTACGAGCTG AAGTACTTCT TTTCCAGTCA CGTCAGTACC
GCCGAAGGCG GTGAGTGCTC GTCCACGGCC ATCCGCGCCA TCATCAAGAA ATTGATTGCG
GCGGAAAATC CGAAAAAGCC ATTGAGCGAC AGCAAGATCG CTGGTTTACT GGAAGAACAA
GGCATACAGG TGGCTCGCCG TACAGTTGCC AAATACCGGG AATCGCTCAG TATTGCGCCT
TCCAGCGAAC GCAAGCGGCT TATGTAA
 
Protein sequence
MTPQLQQAIR LLQLSTLDLQ QEIQEALDSN PMLERQEDAE DYDSPDMLGE HGDQSTLDTT 
PGSYQEGYES GAASEDGGTL EEGDWHERIP SELPVDTAWE DIYQTSASNL PSTDEDEWDF
TTRTSTGESL QSHLLWQLNL TPMSDTDRLI AVTLIDSINS DGYLEAALEE ILASLDPELG
VELDEVEMVL RRIQQFEPAG IAARDLSESL LLQLRQLPPD TPWLEEAKRL AKDYLDLLGN
RDFTQLMRRM KLKEEELRPV IELIQSLNPR PGAQIESSEP EYVVPDVIVR KHNDRWLVEL
NQEAVPRLRI NPHYAGFIRR ADASADNTFM RNQLQEARWF IKSLQSRNET LMKVSTQIVE
HQRGFLDYGE EAMKPLVLHD IAEAVGMHES TISRVTTQKY MHTPRGIYEL KYFFSSHVST
AEGGECSSTA IRAIIKKLIA AENPKKPLSD SKIAGLLEEQ GIQVARRTVA KYRESLSIAP
SSERKRLM