Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3473 |
Symbol | |
ID | 5112977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3780777 |
End bp | 3782672 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640493677 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001178183 |
Protein GI | 146313109 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0442977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.143212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGTAG CTGCACCGCC GACACCAATC AAACGAAATA AGTGTGGATA CCGTCTTATG GAGCAAAACC CGCAGTCACA GCTGAAACTT CTTGTCCAAC GCGGTAAGGA GCAAGGCTAT CTGACCTATG CCGAGGTCAA TGACCATCTG CCGGAAGATA TCGTCGATTC AGATCAAATC GAAGACATCA TCCAAATGAT CAATGACATG GGTATTCAGG TGATGGAAGA AGCACCTGAT GCCGATGATC TGTTGCTGGC TGAAAACTCC AACAACACCG ATGAAGATGC TGAAGAAGCC GCTGCACAGG TTCTGTCCAG CGTTGAATCT GAAATCGGTC GTACCACTGA CCCGGTGCGC ATGTACATGC GCGAAATGGG AACCGTTGAA CTGCTGACCC GCGAAGGCGA AATCGACATC GCGAAACGCA TCGAAGACGG GATCAACCAG GTTCAGTGTT CTGTTGCCGA GTACCCGGAA GCGATCACCT ATCTGCTGGA ACAGTACGAT CGCGTAGAAG CAGAAGAGGC GCGTCTTTCC GACATCATTA CCGGTTTCGT CGATCCTAAC GCTGAAGAAG AAGTCGCTCC GACTGCTACT CACGTTGGTT CTGAGCTCAC GAAAGAAGAG CGTGAAGAGA ACGAGGAAGA AGACGAAGAA GACGAAGAAG AAGAAGACGA CAACAGCATC GATCCTGAGC TGGCTCGCGA GAAGTTTGGC GAACTGCGTA CGCAGTACGA ACTGGCCCGC GACACCATCA AAGCAAAAGG CCGTAGTCAC GCCGCTGCGA AGGAAGAGAT CCAGAAGCTG TCTGACGTGT TCAAGCAGTT CCGCCTGGTA CCAAAGCAGT TTGATTACCT GGTCAACAGT ATGCGCGTCA TGATGGATCG CGTGCGAACT CAGGAACGCA TCATCATGAA ACTGTGCGTT GAACAGTGCA AAATGCCGAA GAAAAACTTC ATCACACTGT TCACCGGCAA CGAAACCAGC GAAACCTGGT TCAACGCTGC TATCGCCATG AACAAACCGT GGTCTGAAAA GCTGCACGAC GTCTCTGATG ACGTTCAGCG CGGCTTGCAG AAACTGCGTC AGATTGAAGA AGAGACCGGC CTGACCATCG AGCAGGTGAA AGACATCAAC CGTCGTATGT CTATCGGCGA AGCGAAAGCC CGTCGTGCGA AGAAAGAGAT GGTTGAGGCG AACTTGCGTC TGGTTATTTC TATCGCGAAG AAATACACCA ACCGCGGTCT GCAGTTCCTG GATCTGATTC AGGAAGGCAA CATCGGTCTG ATGAAAGCGG TAGATAAGTT TGAATACCGT CGTGGTTATA AATTCTCCAC TTACGCAACC TGGTGGATTC GTCAGGCGAT CACCCGCTCT ATCGCGGATC AGGCGCGCAC CATCCGTATT CCGGTGCATA TGATTGAGAC GATTAACAAG CTCAACCGTA TTTCTCGCCA GATGCTGCAA GAGATGGGCC GCGAGCCAAC GCCGGAAGAA CTGGCTGAAC GCATGTTGAT GCCGGAAGAC AAGATCCGTA AAGTGCTGAA AATCGCGAAA GAGCCAATCT CCATGGAAAC GCCAATCGGC GACGATGAAG ATTCGCATCT GGGTGATTTC ATCGAGGATA CTACCCTCGA GCTGCCGCTG GACTCTGCGA CGACCGAGAG CCTGCGTGCT GCCACTCACG ACGTTCTGGC CGGCCTGACC GCCCGCGAAG CGAAAGTCCT GCGTATGCGT TTCGGTATCG ACATGAATAC CGACCACACG CTGGAAGAAG TGGGTAAACA GTTCGACGTA ACCCGCGAAC GTATTCGTCA GATCGAAGCG AAGGCACTGC GCAAACTGCG CCACCCTAGC CGCTCTGAAG TTCTGCGTAG CTTCCTGGAC GATTAA
|
Protein sequence | MLVAAPPTPI KRNKCGYRLM EQNPQSQLKL LVQRGKEQGY LTYAEVNDHL PEDIVDSDQI EDIIQMINDM GIQVMEEAPD ADDLLLAENS NNTDEDAEEA AAQVLSSVES EIGRTTDPVR MYMREMGTVE LLTREGEIDI AKRIEDGINQ VQCSVAEYPE AITYLLEQYD RVEAEEARLS DIITGFVDPN AEEEVAPTAT HVGSELTKEE REENEEEDEE DEEEEDDNSI DPELAREKFG ELRTQYELAR DTIKAKGRSH AAAKEEIQKL SDVFKQFRLV PKQFDYLVNS MRVMMDRVRT QERIIMKLCV EQCKMPKKNF ITLFTGNETS ETWFNAAIAM NKPWSEKLHD VSDDVQRGLQ KLRQIEEETG LTIEQVKDIN RRMSIGEAKA RRAKKEMVEA NLRLVISIAK KYTNRGLQFL DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI PVHMIETINK LNRISRQMLQ EMGREPTPEE LAERMLMPED KIRKVLKIAK EPISMETPIG DDEDSHLGDF IEDTTLELPL DSATTESLRA ATHDVLAGLT AREAKVLRMR FGIDMNTDHT LEEVGKQFDV TRERIRQIEA KALRKLRHPS RSEVLRSFLD D
|
| |