Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4519 |
Symbol | |
ID | 5606666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 5010036 |
End bp | 5011025 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640940086 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001480741 |
Protein GI | 157372752 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00027981 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000064627 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCAGGGTT CTGTGACAGA GTTTCTAAAA CCGCGCCTGG TAGATATCGA GCAAGTCAGT TCGACGCACG CCAAGGTGAC CCTTGAGCCG TTAGAGCGTG GCTTTGGCCA TACTCTCGGC AATGCACTGC GCCGTATTCT GCTTTCGTCT ATGCCGGGTT GCGCGGTGAC CGAGGTTGAG ATTGATGGTG TACTGCATGA GTACAGCACC AAAGAAGGCG TACAGGAAGA TATCCTGGAG ATCCTGCTCA ACCTGAAAGG GCTGGCGGTG AGAGTTCAAG GGAAAGATGA AGTTATCCTT ACCCTGAATA AATCTGGCAT TGGCCCTGTG ACCGCTGCCG ACATTACCCA TGATGGTGAT GTCGAAATCG TCAAGCCTCA GCACGTGATC TGCCACCTGA CCGATGAAAA CGCTGCTATC AGCATGCGTA TCAAAGTTCA ACGTGGTCGT GGTTATGTGC CGGCTTCTGC CCGAATTCAT TCGGAAGAAG ATGAGCGCCC GATCGGTCGT CTGTTGGTTG ACGCCTGCTA TAGCCCTGTA GAGCGTATTG CCTACAATGT TGAAGCAGCG CGTGTAGAAC AGCGTACTGA CCTGGACAAG CTGGTCATCG AAATGGAAAC CAATGGCACG ATCGATCCTG AAGAGGCGAT CCGCCGTGCG GCTACCATCC TGGCTGAACA ACTTGAAGCT TTTGTTGACT TACGTGATGT TCGTCAACCA GAAGTTAAAG AAGAGAAACC AGAATTCGAT CCGATTCTGC TGCGCCCTGT TGACGATCTG GAATTGACTG TCCGCTCTGC TAACTGCCTT AAGGCAGAAG CTATCCACTA CATCGGTGAT CTGGTACAGC GTACCGAGGT TGAGTTGCTG AAAACGCCGA ACCTGGGTAA AAAATCTCTT ACTGAGATTA AAGACGTGCT GGCCTCCCGT GGACTGTCAC TGGGCATGCG CCTGGAAAAC TGGCCACCGG CAAGCATTGC TGACGAGTAA
|
Protein sequence | MQGSVTEFLK PRLVDIEQVS STHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE IDGVLHEYST KEGVQEDILE ILLNLKGLAV RVQGKDEVIL TLNKSGIGPV TAADITHDGD VEIVKPQHVI CHLTDENAAI SMRIKVQRGR GYVPASARIH SEEDERPIGR LLVDACYSPV ERIAYNVEAA RVEQRTDLDK LVIEMETNGT IDPEEAIRRA ATILAEQLEA FVDLRDVRQP EVKEEKPEFD PILLRPVDDL ELTVRSANCL KAEAIHYIGD LVQRTEVELL KTPNLGKKSL TEIKDVLASR GLSLGMRLEN WPPASIADE
|
| |