Gene Shel_19710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_19710 
Symbol 
ID8395860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2205629 
End bp2206579 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content60% 
IMG OID644986722 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_003144334 
Protein GI257064662 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00120516 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.467008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGT TCATGAGGCC AACCGTCACA ACTGAAGAAG TGAGCGATAC CGACGCTCGT 
TTCGTTGTGG AGCCGCTCGA GCGTGGTTTC GGCTATACGC TGGGCAACTG CATGCGTCGC
GTTCTGCTGT CCTCCCTGGA CGGCGCCCGC GCGACCGCCA TCCAGATTGA AGGCGTGCAG
CACGAGTTCA CGACCGCTGA AGGCGTCATC GAGGATGTCA CCGACATCGT CCTGAACGTC
AAGGGTCTTG TTTTCGCCGC TCTCACCGAG GACTACACTG AAGCAACCGC AACTATCTCC
GTCGAGGGTC CCTGCACGGT GACCGGCGCC GACATCCAGG TGCCCACCGA GTTCACCCTC
ATCAACCCGG AGCATGTCAT CGCGACCGTT GCTGACGGCG GAACTCTCAA CATGAGCATC
CGTATTGGTG TTGGCCGCGG CTACGTCTCC GCCGAGCGCA ACAAGCGCAC GGAAGACCCG
ATCGGCATCA TTCCTGTCGA CAGCCTGTTC TCGCCGGTTC GTCGTTGCAC GCTCGCCGTC
AACGACACCC GCGTGGGTCA GCGTACCGAC TTCGATCAGC TGCTGCTGGA AGTCGAGACC
GATGGCTCCA TCGCTCCGAA CGAAGCAGTC TGCCGTGCAG CTAACATCAT TAACCAGTAC
ATGGGTGCTT TCCTGACCCT GGCTGACATC ACCGACGAGG ACGAGGGCGA CATCCCCTCC
ATCTTCGCCA CCGAAGGCCA GGAGTCCAAC GCTGAGCTTG ACAAGCAGAT CGAGGATCTG
GACCTTTCCG TCCGCTCCTA CAACTGCCTC AAGCGCGCCG GCATCCATTC TGTGCGCCAG
CTGGTCGAGT TCTCCGAGAA CGACCTGCTC AACATTCGTA ACTTTGGCGC GAAGTCCATC
GAGGAGGTCA AGGACAAGCT GATCTCCATG GACCTCAACT TGAAGCAATA G
 
Protein sequence
MAEFMRPTVT TEEVSDTDAR FVVEPLERGF GYTLGNCMRR VLLSSLDGAR ATAIQIEGVQ 
HEFTTAEGVI EDVTDIVLNV KGLVFAALTE DYTEATATIS VEGPCTVTGA DIQVPTEFTL
INPEHVIATV ADGGTLNMSI RIGVGRGYVS AERNKRTEDP IGIIPVDSLF SPVRRCTLAV
NDTRVGQRTD FDQLLLEVET DGSIAPNEAV CRAANIINQY MGAFLTLADI TDEDEGDIPS
IFATEGQESN AELDKQIEDL DLSVRSYNCL KRAGIHSVRQ LVEFSENDLL NIRNFGAKSI
EEVKDKLISM DLNLKQ