Gene Spro_4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4519 
Symbol 
ID5606666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5010036 
End bp5011025 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content52% 
IMG OID640940086 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001480741 
Protein GI157372752 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00027981 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000064627 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGGGTT CTGTGACAGA GTTTCTAAAA CCGCGCCTGG TAGATATCGA GCAAGTCAGT 
TCGACGCACG CCAAGGTGAC CCTTGAGCCG TTAGAGCGTG GCTTTGGCCA TACTCTCGGC
AATGCACTGC GCCGTATTCT GCTTTCGTCT ATGCCGGGTT GCGCGGTGAC CGAGGTTGAG
ATTGATGGTG TACTGCATGA GTACAGCACC AAAGAAGGCG TACAGGAAGA TATCCTGGAG
ATCCTGCTCA ACCTGAAAGG GCTGGCGGTG AGAGTTCAAG GGAAAGATGA AGTTATCCTT
ACCCTGAATA AATCTGGCAT TGGCCCTGTG ACCGCTGCCG ACATTACCCA TGATGGTGAT
GTCGAAATCG TCAAGCCTCA GCACGTGATC TGCCACCTGA CCGATGAAAA CGCTGCTATC
AGCATGCGTA TCAAAGTTCA ACGTGGTCGT GGTTATGTGC CGGCTTCTGC CCGAATTCAT
TCGGAAGAAG ATGAGCGCCC GATCGGTCGT CTGTTGGTTG ACGCCTGCTA TAGCCCTGTA
GAGCGTATTG CCTACAATGT TGAAGCAGCG CGTGTAGAAC AGCGTACTGA CCTGGACAAG
CTGGTCATCG AAATGGAAAC CAATGGCACG ATCGATCCTG AAGAGGCGAT CCGCCGTGCG
GCTACCATCC TGGCTGAACA ACTTGAAGCT TTTGTTGACT TACGTGATGT TCGTCAACCA
GAAGTTAAAG AAGAGAAACC AGAATTCGAT CCGATTCTGC TGCGCCCTGT TGACGATCTG
GAATTGACTG TCCGCTCTGC TAACTGCCTT AAGGCAGAAG CTATCCACTA CATCGGTGAT
CTGGTACAGC GTACCGAGGT TGAGTTGCTG AAAACGCCGA ACCTGGGTAA AAAATCTCTT
ACTGAGATTA AAGACGTGCT GGCCTCCCGT GGACTGTCAC TGGGCATGCG CCTGGAAAAC
TGGCCACCGG CAAGCATTGC TGACGAGTAA
 
Protein sequence
MQGSVTEFLK PRLVDIEQVS STHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE 
IDGVLHEYST KEGVQEDILE ILLNLKGLAV RVQGKDEVIL TLNKSGIGPV TAADITHDGD
VEIVKPQHVI CHLTDENAAI SMRIKVQRGR GYVPASARIH SEEDERPIGR LLVDACYSPV
ERIAYNVEAA RVEQRTDLDK LVIEMETNGT IDPEEAIRRA ATILAEQLEA FVDLRDVRQP
EVKEEKPEFD PILLRPVDDL ELTVRSANCL KAEAIHYIGD LVQRTEVELL KTPNLGKKSL
TEIKDVLASR GLSLGMRLEN WPPASIADE