Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3683 |
Symbol | rpoA |
ID | 6484432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3565318 |
End bp | 3566307 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642738955 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_002042666 |
Protein GI | 194442652 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.903884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGTT CTGTGACAGA GTTTCTAAAA CCGCGCCTGG TAGATATCGA GCAAGTGAGT TCGACGCACG CCAAGGTGAC CCTTGAGCCT TTAGAGCGTG GCTTCGGCCA TACTCTGGGT AACGCACTGC GCCGTATTCT GCTCTCATCG ATGCCGGGTT GCGCGGTGAC CGAGGTTGAG ATTGATGGTG TACTACATGA GTACAGCACC AAAGAAGGCG TTCAGGAAGA CATCCTGGAA ATCCTGCTCA ACCTGAAAGG GCTGGCGGTG AGAGTTCAGG GTAAAGATGA AGTTATTCTT ACCCTGAATA AATCTGGCAT TGGCCCTGTG ACTGCAGCCG ATATCACCCA TGATGGGGAT GTCGAAATCG TCAAGCCGCA GCACGTGATC TGCCACCTGA CCGATGAAAA CGCGTCTATT AGTATGCGTA TCAAAGTTCA GCGCGGTCGT GGTTATGTGC CGGCTTCTAC CCGAATTCAT TCGGAAGAAG ATGAGCGCCC AATCGGCCGT CTGCTGGTCG ACGCCTGCTA CAGCCCTGTA GAGCGTATTG CCTACAATGT TGAAGCAGCG CGTGTAGAAC AGCGTACCGA CCTGGACAAG CTGGTCATCG AAATGGAAAC CAACGGCACA ATCGATCCTG AAGAGGCGAT TCGTCGTGCG GCAACCATCC TGGCTGAACA ACTGGAAGCT TTCGTTGATT TACGTGATGT ACGTCAACCG GAAGTGAAAG AAGAGAAACC AGAATTCGAT CCGATCCTGC TGCGCCCTGT TGACGATCTG GAATTGACTG TCCGCTCTGC TAACTGCCTC AAGGCAGAAG CTATCCACTA TATCGGTGAT CTGGTACAGC GTACCGAGGT TGAGCTTCTT AAGACGCCTA ACCTGGGTAA AAAATCTCTT ACCGAGATTA AAGACGTGCT GGCTTCCCGT GGACTGTCTC TGGGTATGCG CCTGGAAAAC TGGCCACCGG CAAGCATCGC TGACGAGTAA
|
Protein sequence | MQGSVTEFLK PRLVDIEQVS STHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE IDGVLHEYST KEGVQEDILE ILLNLKGLAV RVQGKDEVIL TLNKSGIGPV TAADITHDGD VEIVKPQHVI CHLTDENASI SMRIKVQRGR GYVPASTRIH SEEDERPIGR LLVDACYSPV ERIAYNVEAA RVEQRTDLDK LVIEMETNGT IDPEEAIRRA ATILAEQLEA FVDLRDVRQP EVKEEKPEFD PILLRPVDDL ELTVRSANCL KAEAIHYIGD LVQRTEVELL KTPNLGKKSL TEIKDVLASR GLSLGMRLEN WPPASIADE
|
| |