Gene Avin_47000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_47000 
SymbolrpoD 
ID7763563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4770914 
End bp4772776 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content64% 
IMG OID643807544 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002801780 
Protein GI226946707 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTAA AAGCTCAACA GCAGTCTCGC ATCAAAGAGT TGATCGCTCG CGGCCGCGAA 
CAGGGTTACC TGACCTATGC CGAGGTCAAC GACCACCTGC CCGAGGATAT TTCCGATCCG
GAACAGGTGG AAGACATCAT CCGCATGATC AACGACATGG GGATCAACGT ATTCGAAAGT
GCCCCGGATA CGGATGCCCT GTTGTTGGCC GAAGCCGACA CCGACGAAGC TGCCGCCGAG
GAAGCCGCCG CGGCCCTGGC CGCCGTGGAA ACCGACATCG GTCGTACCAC CGACCCGGTG
CGCATGTACA TGCGCGAAAT GGGTACCGTG GAACTGCTCA CCCGCGAAGG CGAAATCGAG
ATCGCCAAGC GCATCGAGGA AGGCATCCGC GAGGTGATGA GCGCCATCGC CCACTTCCCG
GGCACCGTGG ACGGCATCCT CGCGGAATAC CAGCGCGTCA CCAGCGAAGG CGGCCGCCTG
TCCGACATCC TCAGCGGCTA CATCGATCCC GACGACGATT CGGCGGTGCC TGCCGAGGCC
GAGGTTCCCG TCGATCTCAA GAGCAAGAGC GCCGCGACCC CCGCCGCATC CGAGGACGAG
GACGAGGACG AGAGCGAGGA AAGCGACAGC GACGACGAGG AAGGCGACGG TGGTCCCGAC
CCGGAGATCG CCCGCCAGCG CTTCGGCGCT GTCGCCGAGC AGTTGGGCAG GACCCGTGTG
GCTCTCGAAC GGCACGGGCG CCACAGCGTC GAGGGCCTTG AGGCGCTGCA GGCGCTGGCC
AGCCTGTTCA TGCCGATCAA GCTGGTGCCC AAACAGTACG ACACCCTGGT CGAGCAGGTG
CGTGACGCCC TGACCCGCGT GCGCGCCCAG GAACGCGCGA TCATGCAGTT GTGCGTGCGC
GATGCGCGCA TGCCGCGCGC CGACTTCCTG CGCCAGTTTC CCGGCAACGA GACCGATCAG
GACTGGGTCG ACCTCCTCGC CAAGGGCAAG GCCAAGTACG CCGAAGCCCT GGGCAATCTG
GCCGACGACA TCAAGCAGTG CCAGCAGAAG CTGATCGACC TCGAGCAGAA GGTCGGCCTG
ACCATCGCCG AAATCAAGGA CATCAACCGC CAGATGTCCA TCGGCGAAGC CAAGGCCCGC
CGCGCCAAGA AGGAAATGGT CGAGGCCAAC CTGCGCCTGG TGATCTCCAT CGCCAAGAAA
TACACCAACC GCGGCCTGCA GTTCCTCGAT CTGATCCAGG AAGGCAACAT CGGCCTGATG
AAGGCGGTGG ACAAGTTCGA ATACCGCCGC GGCTACAAGT TCTCGACCTA CGCCACCTGG
TGGATCCGCC AGGCGATCAC CCGCTCGATC GCCGACCAGG CGCGCACCAT CCGCATCCCG
GTGCACATGA TCGAGACCAT CAACAAGCTC AACCGCATCT CCCGCCAGAT GCTCCAGGAG
ATGGGCCGCG AACCCACGCC GGAAGAACTC GGCGAGCGCA TGGACATGCC CGAGGACAAG
ATCCGCAAGG TGCTGAAGAT CGCCAAGGAA CCGATCTCCA TGGAAACCCC GATCGGCGAC
GACGAGGACT CGCATCTGGG CGACTTCATC GAGGACTCGA CCATGCAGTC GCCGATCGAG
GTGGCGACCG TGGAAAGCCT CAAGGAGGCC ACCCGCGAAG TCCTCGCCGG CCTCACCGCC
CGGGAAGCCA AGGTGCTGCG CATGCGCTTC GGCATCGACA TGAACACCGA CCACACCCTC
GAGGAGGTCG GCAAGCAGTT CGACGTGACC CGCGAGCGCA TCCGTCAGAT CGAGGCCAAG
GCCCTGCGCA AGCTGCGCCA CCCCTCGCGA AGCGAGCACC TGCGCTCCTT CCTTGACGAG
TGA
 
Protein sequence
MSVKAQQQSR IKELIARGRE QGYLTYAEVN DHLPEDISDP EQVEDIIRMI NDMGINVFES 
APDTDALLLA EADTDEAAAE EAAAALAAVE TDIGRTTDPV RMYMREMGTV ELLTREGEIE
IAKRIEEGIR EVMSAIAHFP GTVDGILAEY QRVTSEGGRL SDILSGYIDP DDDSAVPAEA
EVPVDLKSKS AATPAASEDE DEDESEESDS DDEEGDGGPD PEIARQRFGA VAEQLGRTRV
ALERHGRHSV EGLEALQALA SLFMPIKLVP KQYDTLVEQV RDALTRVRAQ ERAIMQLCVR
DARMPRADFL RQFPGNETDQ DWVDLLAKGK AKYAEALGNL ADDIKQCQQK LIDLEQKVGL
TIAEIKDINR QMSIGEAKAR RAKKEMVEAN LRLVISIAKK YTNRGLQFLD LIQEGNIGLM
KAVDKFEYRR GYKFSTYATW WIRQAITRSI ADQARTIRIP VHMIETINKL NRISRQMLQE
MGREPTPEEL GERMDMPEDK IRKVLKIAKE PISMETPIGD DEDSHLGDFI EDSTMQSPIE
VATVESLKEA TREVLAGLTA REAKVLRMRF GIDMNTDHTL EEVGKQFDVT RERIRQIEAK
ALRKLRHPSR SEHLRSFLDE