Gene Rcas_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3999 
Symbol 
ID5541509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5210954 
End bp5211925 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content55% 
IMG OID640896111 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001434050 
Protein GI156743921 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000757235 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGGACA TCGCCATGCC AAAGATTGAA GTCGTTACTG CTGCCGAAAA CTATGGGCGG 
TTCAAAATCG AGCCGCTCGA TCCCGGGTAT GGGCATACCC TGGGGAATGC GTTACGCCGC
GTGCTCCTGT CGTCTATCCC CGGCGCAGCG ATTACGAAGA TCAAAATTGA TGGGGTGTTT
CACGAGTTCT CGACTATTTC GGGGATCAAA GAAGACGTCA CTGAAATTGT CTTGAACATC
AAAGGTGTTC GTCTGCGTTC CTATGCCGAA CGTCCGGTGA AAATCTCGTT GTCGAAGCGC
GGATCGGGCA TTGTGCGCGC TGCGGATATC GACGCTCCCA GCAATGTCGA GATTGTCAAT
CCTTTCCACT ATATCTGTAC GATTGATCGC GACGACGCTA TGCTGGAAAT GGAGATGACG
GTCGAACGCG GGCGCGGCTA TCTGCCCGCC GATCAGCGTG ACGCACTGCC CATCGGTGAG
ATCCCGATTG ATGCCATTTT CACACCGGTG CCGAAAGTTA ACTATGTGGT CGAGAATATT
CGCGTCGGGC AGGCGACCGA CTTCGATAGT CTGCTGATCG AAATCTGGAC GGATGGCACG
ATCAAACCGG GGGACGCCCT GAGCCATGCG GCACAGGTGC TTGTGCAATA TTCTCAGACG
ATCGCTGATT TCAATCGCCT CTCGACCGAA ACAGAGTCAA CGGCGGCGCC AAATGGGCTG
GCCATCCCGG CGGATATTTA TGATACGCCG ATTGAAGAGC TTGATCTCTC GACACGCACC
TACAATTGCC TCAAGCGCGC CGACATTACG AAGGTCGGTC AGGTGCTCGA AATGGACGAG
AAGGCGCTGC TTTCGGTGCG CAATCTGGGG CAAAAATCAA TGGAGGAGAT TCGCGACAAA
CTGATCGAAC GCGGGTATAT CCCCCGGATT GGTCAGACGA CGAACAGCTC TCCCGCAGGA
ATCGAGAGTT GA
 
Protein sequence
MLDIAMPKIE VVTAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA ITKIKIDGVF 
HEFSTISGIK EDVTEIVLNI KGVRLRSYAE RPVKISLSKR GSGIVRAADI DAPSNVEIVN
PFHYICTIDR DDAMLEMEMT VERGRGYLPA DQRDALPIGE IPIDAIFTPV PKVNYVVENI
RVGQATDFDS LLIEIWTDGT IKPGDALSHA AQVLVQYSQT IADFNRLSTE TESTAAPNGL
AIPADIYDTP IEELDLSTRT YNCLKRADIT KVGQVLEMDE KALLSVRNLG QKSMEEIRDK
LIERGYIPRI GQTTNSSPAG IES