Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3999 |
Symbol | |
ID | 5541509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5210954 |
End bp | 5211925 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640896111 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001434050 |
Protein GI | 156743921 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0332129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000757235 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTGGACA TCGCCATGCC AAAGATTGAA GTCGTTACTG CTGCCGAAAA CTATGGGCGG TTCAAAATCG AGCCGCTCGA TCCCGGGTAT GGGCATACCC TGGGGAATGC GTTACGCCGC GTGCTCCTGT CGTCTATCCC CGGCGCAGCG ATTACGAAGA TCAAAATTGA TGGGGTGTTT CACGAGTTCT CGACTATTTC GGGGATCAAA GAAGACGTCA CTGAAATTGT CTTGAACATC AAAGGTGTTC GTCTGCGTTC CTATGCCGAA CGTCCGGTGA AAATCTCGTT GTCGAAGCGC GGATCGGGCA TTGTGCGCGC TGCGGATATC GACGCTCCCA GCAATGTCGA GATTGTCAAT CCTTTCCACT ATATCTGTAC GATTGATCGC GACGACGCTA TGCTGGAAAT GGAGATGACG GTCGAACGCG GGCGCGGCTA TCTGCCCGCC GATCAGCGTG ACGCACTGCC CATCGGTGAG ATCCCGATTG ATGCCATTTT CACACCGGTG CCGAAAGTTA ACTATGTGGT CGAGAATATT CGCGTCGGGC AGGCGACCGA CTTCGATAGT CTGCTGATCG AAATCTGGAC GGATGGCACG ATCAAACCGG GGGACGCCCT GAGCCATGCG GCACAGGTGC TTGTGCAATA TTCTCAGACG ATCGCTGATT TCAATCGCCT CTCGACCGAA ACAGAGTCAA CGGCGGCGCC AAATGGGCTG GCCATCCCGG CGGATATTTA TGATACGCCG ATTGAAGAGC TTGATCTCTC GACACGCACC TACAATTGCC TCAAGCGCGC CGACATTACG AAGGTCGGTC AGGTGCTCGA AATGGACGAG AAGGCGCTGC TTTCGGTGCG CAATCTGGGG CAAAAATCAA TGGAGGAGAT TCGCGACAAA CTGATCGAAC GCGGGTATAT CCCCCGGATT GGTCAGACGA CGAACAGCTC TCCCGCAGGA ATCGAGAGTT GA
|
Protein sequence | MLDIAMPKIE VVTAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA ITKIKIDGVF HEFSTISGIK EDVTEIVLNI KGVRLRSYAE RPVKISLSKR GSGIVRAADI DAPSNVEIVN PFHYICTIDR DDAMLEMEMT VERGRGYLPA DQRDALPIGE IPIDAIFTPV PKVNYVVENI RVGQATDFDS LLIEIWTDGT IKPGDALSHA AQVLVQYSQT IADFNRLSTE TESTAAPNGL AIPADIYDTP IEELDLSTRT YNCLKRADIT KVGQVLEMDE KALLSVRNLG QKSMEEIRDK LIERGYIPRI GQTTNSSPAG IES
|
| |