Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0733 |
Symbol | |
ID | 5693568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 852995 |
End bp | 854014 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641263330 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001528620 |
Protein GI | 158520750 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.305815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATG AATTGATGTA TATGAACTGG CAGGAAATGA TTCAGCCGGA CAAGATTCAG GTTGAGGCTG CCACCCCTTT TTATGGAAAG TTTATCTGCG AGCCGTTGGG CCGGGGATTT GGCATTACCA TCGGCAACGC GTTGCGGCGC ATCATCATCT CCTCGCTGCA CGGCGCCGCG ATTACGTCCG TGAAGATCGA CAATGTGATG CATGAATACT CCACGGTTGA AGGTGTACTG GAGGATGTGT CCGAGATCAT ACTGAACCTC AAAGAGGTGC GGCTGAAGAC CAGCACGGCG GCGGCCAAGA CCATTCGTAT CGACGCCGCG GGACCCGGGG TGGTGACCGC CGGTGACATC GCCAGCCCCG ACGGGCGGGT GGAAATCCTG AACCCGGAGT CTCACATCGC CACCCTTTCC GAAGGCGCCA CCCTGAAGAT GGAGATGACG GTAAAGGTGG GCCGGGGATA CGCGCTGGCC GAGGCCAACA AGGATGAGGA GACGCCGGTC AACACCATTC CCATTGATGC CATGTTCTCC CCGATTCGCC GGGTCAATTA CGTGGTGGGC AACTCCCGGG TCAAGCAGAA GACCGATTTT GACAAGCTGA CCCTGGAAGT GTGGACCGAC GGCAGCGTGC TGCCCGAGGA CGCGGTGGCC TTTGCCGCCA AGATCATGAA AGAGCAGATG AATGTTTTCA TCAATTTTGA TGAAAGCGCC GAGCCCGAGC ATGCCGGCCG GAAAGACGAC AGCGGCGGCA AGGTGTTCAA CGAAAACCTG TACCGCAGCG TCAACGAGCT CGAACTTTCC GTCCGCAGCT CCAACTGCCT GAAGAACGCC GAAATCGACA AGCTTTATCA ACTGGTCCAG AAGACCGAGT CGGAGATGCT GAAGACCAAA AACTTCGGCA GAAAGTCATT GAACGAAATC AAGGAACTGC TGGCCGAGAT GGGCCTTTCC CTTGGCATGG ACCTGGAAGG GTTTGTGCCG CCGGCGGAAG ACAATAAGGA AGGGGAGTAG
|
Protein sequence | MENELMYMNW QEMIQPDKIQ VEAATPFYGK FICEPLGRGF GITIGNALRR IIISSLHGAA ITSVKIDNVM HEYSTVEGVL EDVSEIILNL KEVRLKTSTA AAKTIRIDAA GPGVVTAGDI ASPDGRVEIL NPESHIATLS EGATLKMEMT VKVGRGYALA EANKDEETPV NTIPIDAMFS PIRRVNYVVG NSRVKQKTDF DKLTLEVWTD GSVLPEDAVA FAAKIMKEQM NVFINFDESA EPEHAGRKDD SGGKVFNENL YRSVNELELS VRSSNCLKNA EIDKLYQLVQ KTESEMLKTK NFGRKSLNEI KELLAEMGLS LGMDLEGFVP PAEDNKEGE
|
| |