Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3726 |
Symbol | |
ID | 5112290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4040179 |
End bp | 4041168 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640493937 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001178434 |
Protein GI | 146313360 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0956706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00604869 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGGTT CTGTGACAGA GTTTCTAAAA CCGCGCCTGG TAGATATCGA GCAAGTGAGT TCGACGCACG CCAAGGTGAC CCTTGAGCCT TTAGAGCGTG GCTTTGGCCA TACTCTGGGT AACGCACTGC GCCGTATTCT GCTCTCGTCG ATGCCGGGTT GTGCGGTAAC CGAAGTTGAG ATTGATGGTG TACTCCATGA GTACAGCACC AAAGAAGGCG TTCAGGAAGA TATCCTTGAA ATCCTGCTCA ACCTGAAAGG GCTGGCGGTG AGAGTTCAGG GGAAAGATGA AGTTATCCTT ACTCTGAATA AATCTGGCAT TGGCCCTGTG ACTGCAGCCG ATATCACCCA TGATGGTGAT GTTGAAATCG TCAAGCCGCA GCACGTGATC TGCCACCTGA CCGATGAGAA CGCAGCGATT AGCATGCGTA TCAAAGTTCA GCGCGGTCGT GGTTATGTGC CGGCTTCTGC CCGAATTCAT TCGGAAGAAG ATGAGCGCCC AATTGGCCGT CTGCTCGTCG ATGCATGTTA CAGCCCTGTA GAGCGTATCG CCTACAATGT TGAAGCAGCT CGTGTCGAAC AGCGTACCGA CCTGGACAAG CTGGTCATCG AAATGGAAAC CAACGGCACA ATCGATCCTG AAGAGGCGAT TCGTCGTGCG GCAACCATTC TGGCAGAACA ACTTGAAGCT TTTGTTGACC TACGAGATGT TCGTCAGCCA GAAGTTAAAG AAGAGAAACC AGAGTTCGAT CCGATCTTGC TGCGCCCTGT TGACGATCTG GAATTGACTG TCCGCTCTGC TAACTGCCTT AAGGCAGAAG CTATCCACTA TATCGGTGAT CTGGTACAGC GTACCGAGGT TGAGTTGCTG AAAACGCCGA ACCTGGGTAA AAAATCTCTT ACTGAGATTA AAGACGTGCT GGCTTCCCGT GGTCTGTCTC TGGGCATGCG CCTGGAAAAC TGGCCACCAG CAAGCATTGC TGACGAGTAA
|
Protein sequence | MQGSVTEFLK PRLVDIEQVS STHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE IDGVLHEYST KEGVQEDILE ILLNLKGLAV RVQGKDEVIL TLNKSGIGPV TAADITHDGD VEIVKPQHVI CHLTDENAAI SMRIKVQRGR GYVPASARIH SEEDERPIGR LLVDACYSPV ERIAYNVEAA RVEQRTDLDK LVIEMETNGT IDPEEAIRRA ATILAEQLEA FVDLRDVRQP EVKEEKPEFD PILLRPVDDL ELTVRSANCL KAEAIHYIGD LVQRTEVELL KTPNLGKKSL TEIKDVLASR GLSLGMRLEN WPPASIADE
|
| |