Gene Spro_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2195 
Symbol 
ID5605580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2397239 
End bp2398264 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID640937734 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001478424 
Protein GI157370435 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.578042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000679153 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACGA TTAAAGATGT GGCCAAACGC GCTGGCGTTT CCACCACCAC CGTTTCGCAC 
GTCATCAATA AGACTCGTTT CGTCGCCGAA GAGACAAAAG CGGCCGTTGG CGCTGCGATT
AAAGAGCTGC ATTACTCACC CAGCGCCGTA GCGCGCAGCC TGAAGGTCAA TCACACCAAA
TCGATTGGCC TGCTGGCCAC CTCCAGCGAA GCCCCCTACT TTGCCGAAGT GATCGAGGCG
GTAGAAAACA GCTGCTACAG CAAAGGCTAT ACGCTGATTT TGTGTAACTC GCACAACAAT
CTGGACAAAC AGCGGGCCTA TCTGGCGATG CTGGCGCAAA AGCGTGTCGA TGGCCTGCTG
GTGATGTGTT CGGAATATCC AGACCAATTG CTCGGCATGC TGGAAGACTA CCGCAACATC
CCAATGGTCG TGATGGACTG GGGCGCGGCC CGCGGTGATT TTACCGATAG CATCATTGAT
AACGCCTTCG CAGGCGGCTA TCTGGCCGGG CGGTATCTGA TTGAACGCGG TCACCGCGAT
ATCGGAGCCA TTCCAGGCCA ACTGTCGCGC AATACCGGTG GCGGCCGCCA TCAGGGCTTT
TTAAAAGCCA TGGAAGAAGC CAATATTGAA GTACGTGACG AGTGGATTGT TCAGGGTGAC
TTTGAGCCGG AATCCGGCTA CAAGGCCATG CACCAGATCC TGTCGCAAAA ACATCGCCCG
ACCGCGGTAT TCTGCGGCGG CGACATCATG GCGATGGGCG CGATCTGCGC CGCCGACGAA
CTTGGGCTGC GGGTGCCACA AGACATTTCG GTGATTGGCT ACGATAACGT GCGTAACGCC
CGCTATTTCA CCCCGGCACT GACCACCATT CATCAACCCA AAGAGCGTTT GGGTGAAATG
GCGTTCACCA TGTTGCTGGA CCGTATTATC AGCAAGCGTG AAGAGTCGCA GGTGATTGAA
GTGCATCCGA AACTGATTGA GCGTCGTTCG GTCGCTGACG GCCCATTCAT CGATTACCGC
CGCTAA
 
Protein sequence
MATIKDVAKR AGVSTTTVSH VINKTRFVAE ETKAAVGAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE APYFAEVIEA VENSCYSKGY TLILCNSHNN LDKQRAYLAM LAQKRVDGLL
VMCSEYPDQL LGMLEDYRNI PMVVMDWGAA RGDFTDSIID NAFAGGYLAG RYLIERGHRD
IGAIPGQLSR NTGGGRHQGF LKAMEEANIE VRDEWIVQGD FEPESGYKAM HQILSQKHRP
TAVFCGGDIM AMGAICAADE LGLRVPQDIS VIGYDNVRNA RYFTPALTTI HQPKERLGEM
AFTMLLDRII SKREESQVIE VHPKLIERRS VADGPFIDYR R