Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3035 |
Symbol | |
ID | 7315963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3182021 |
End bp | 3183880 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643617932 |
Product | RNA polymerase, sigma 70 subunit, RpoD |
Protein accession | YP_002515091 |
Protein GI | 220936192 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCACG CAGAACAGCA GTCACGCCTC AAAGAGCTCA TCGCCAAGGG CAAGGAGCAG GGCTTCCTGA CCTACGCCGA GGTCAACGAT CACCTGCCCG ACACCATCGT CGATCCGGAG CAGATCGAAG ACATCATCGC CATGATCAAC GACATGGGGA TCGCGGTGCA TGAGCAGGCG CCGGACAGCG ACAGCCTGAT CCTCTCCGAC AGCACCGTCT CCACCGACGA GGACGCCGCC GAGGAGGCCG CCGCCGCGCT GGCCTCCGTG GACAGCGAGT TCGGCCGCAC CACCGACCCG GTGCGCATGT ACATGCGCGA GATGGGTACC GTGGAGCTGC TGACCCGTGA AGGCGAAATC AAGATCGCCA AGCGCATCGA GGACGGCCTC GGCCAGGTGC TGTTCGCGCT CGCCCACTAC CCCCAGTCCA TCAGCACCCT GCTGGCCGAG TTCGACAAGG TCGAGGCCGG CGAGATGAAG CTCACCGACC TGGTGGTCAG CTTCATCGAC CCCAACGCCG ATGACGTGCC CGCCGTTGCC GCCAATGAGG CCGACAGCGC GGACACCGAC GCCAGTGCCA GCGACGACGA CAGCGATTCC GACAGCGACG ACGAAGACGA CGACGACACC GCCAGCGAGC CCGAGGACAC CGGCCCCGAT CCGGAAGAGG CCCGGGAGCG CTTTACCAAA CTGCGCGCCA TGTACGAAGA GGCCATGGGC TTCGCCGCCA AGAAGCACAC CGCCAAGTAC CGCAAGGTGC GCGAGGAGAT GGCCAGCTAC TTCATGGAGT TCAAGCTGCT GCCCCGCTTT GTGGACCAGT TGACCGCCGA CCTGCACGAG ATGGTGGAGC GCATCCGCAT GCAGGAGCGC ACCGTGATGG ACATCTGCGT CAACCGCTGC AACATGCCGC GCAAGGAATT CATCACCGCG TTCCCCGAGA ACGAGACCAA CCTCAAGTGG CTGGACACCC AGACCAAGGG CAAGTCCAAG CACGCCAAGC TGCTGGCCGA GCACAAGGAC GAGATCCTGC GCATCCAGAA GAAGCTGGTG GCTATCGAGG ACGAGTCCAA CCTCACCATC AGTGAGATCA AGGACATCAA CCGCCGCATG TCCATCGGCG AGGCCAAGGC ACGCCGCGCC AAGAAGGAGA TGGTGGAGGC CAATCTGCGC CTGGTGATCT CCATCGCCAA GAAGTACACC AACCGCGGCC TGCAGTTCCT GGACCTGATC CAGGAGGGCA ACATCGGCCT GATGAAGGCG GTGGACAAGT TCGAATACCG TCGCGGCTAC AAGTTCTCCA CCTACGCCAC CTGGTGGATC CGGCAGGCCA TCACCCGCTC CATCGCGGAC CAGGCCCGCA CCATCCGCAT CCCGGTGCAC ATGATCGAGA CCATCAACAA GCTCAACCGC ATCAGCCGGC AGATGCTCCA GGAGATGGGT CGCGAGGCCA CCCCCGAGGA ACTCTCGGAA CGCATGGAAA TGCCCGAGGA CAAGGTGCGC AAGGTGCTCA AGATCGCCAA GGAGCCCATC TCCATGGAGA CCCCCATTGG CGATGACGAG GACTCCCATC TGGGCGACTT CATCGAGGAC GTGAACGTCA TGTCCCCCAT CGAGGCCGCC ACCCGCGAGG GCCTGTCCGA GGCCACCCGC GACGTGCTCG CCAGCCTCAC CCCCCGGGAG GCCAAGGTGC TGCGCATGCG CTTCGGCATC GACATGAACA CCGACCACAC CCTGGAAGAA GTGGGCAAGC AGTTCGACGT CACCCGCGAA CGCATCCGCC AGATCGAAGC CAAGGCCCTG CGCAAACTGC GCCACCCGAG CCGCTCCGAG ATGCTCAGAA GCTTCCTGGA TCTGGAGTAA
|
Protein sequence | MDHAEQQSRL KELIAKGKEQ GFLTYAEVND HLPDTIVDPE QIEDIIAMIN DMGIAVHEQA PDSDSLILSD STVSTDEDAA EEAAAALASV DSEFGRTTDP VRMYMREMGT VELLTREGEI KIAKRIEDGL GQVLFALAHY PQSISTLLAE FDKVEAGEMK LTDLVVSFID PNADDVPAVA ANEADSADTD ASASDDDSDS DSDDEDDDDT ASEPEDTGPD PEEARERFTK LRAMYEEAMG FAAKKHTAKY RKVREEMASY FMEFKLLPRF VDQLTADLHE MVERIRMQER TVMDICVNRC NMPRKEFITA FPENETNLKW LDTQTKGKSK HAKLLAEHKD EILRIQKKLV AIEDESNLTI SEIKDINRRM SIGEAKARRA KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI RQAITRSIAD QARTIRIPVH MIETINKLNR ISRQMLQEMG REATPEELSE RMEMPEDKVR KVLKIAKEPI SMETPIGDDE DSHLGDFIED VNVMSPIEAA TREGLSEATR DVLASLTPRE AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPSRSE MLRSFLDLE
|
| |