Gene Spro_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4083 
Symbol 
ID5606980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4538281 
End bp4539510 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content38% 
IMG OID640939644 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001480306 
Protein GI157372317 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG ATAACAAGGT GCCAGAGATT CGGTTTAAGG GGTTTAGTGA GGCGTGGGTT 
GATAGTGATC TGAAAAGTTT TAGTGATTCG TTTAGTTATG GTTTAAACGC TTCTGCAGTT
AAATTCGATG GTGTTAATAA GTACCTTAGA ATTACAGATA TCGATGAGAA AAGTAGGAAT
TTTGACTATT GTCAGTTAAC CTCACCAGAT GCCGACTTAT CAAAATCTGA TAATTATTTG
TTAAAGAAAG GTGATATTTT ATTTGCTAGA ACTGGTGCTA GTGTCGGCAA GACTTATATT
TACAACGAGC AAGACGGCAA AGTCTATTTT GCTGGATTCC TAATTCGAGC AAGCATCAAT
CATGAAGCAA GTGCGCAGTT TATTTTTCAA AACACGCAAA CTCATGAGTA TGCAAGATTT
GTTGCTACTA CCTCTCAAAG GTCAGGGCAA CCAGGAATAA ATGCAAAAGA ATATGGTGAA
TACCGACTAT TTTCACCTAC TGAACCAGAA CAAACCCAAA TCGGCAACTA CTTCCAAAAA
CTCGACACGC TAATCAATCA ACACCAACAA AAGCATGACA AGCTCAGCAG CATTAAAAAA
GCTATGCTGG AAAAGATGTT TCCCAAACAA GGTGAAACCA TTCCAGAAAT CCGCTTTAAA
GGGTTTAGTG GGGAGTGGGA GGAAAAAAGC GTTGGCCAGT TTGGTGAGAT TATAACGGGC
TCTACACCGT CGACACAGAA CTTAATTAAT TATTCTAATG ATGGAATACC TTGGGTTACA
CCAACTGATA TTTCGAGAAA CGTTACATTT AATACAGCTA AAAGGTTATC ACAAACAGGC
TGTAAAGTAG CTCGCATAGT CCCAAAAGAT ACAATTTTGG TTACATGTAT CGCCAGCATA
GGAAAAAATA CAATCCTAGG AACCCAAGGT GGTTTTAACC AGCAAATAAA TGGCATTATT
CCAAACCAAA AAGATAACCA TCCATATTTT ATATTTTCTG CAAGCATATT GTGGTCAGAA
AAATTAAAGC GTTCAGCTGC TTCTGGAACC ATGCAAATTG TAAATAAAAC TGAGTTTTCC
GAATTAAAAA CCCGTGCACC AAAAAAAGAA GAACAAACCG CCATCGGCAA CTACTTCCAA
AAACTCGACA GTCTGATCGA CCAACACCAA CAACAGATCA CCAAACTCAA TAACATCAAG
CAGGCCTGCT TAAGTAAAAT GTTTGTCTAA
 
Protein sequence
MSVDNKVPEI RFKGFSEAWV DSDLKSFSDS FSYGLNASAV KFDGVNKYLR ITDIDEKSRN 
FDYCQLTSPD ADLSKSDNYL LKKGDILFAR TGASVGKTYI YNEQDGKVYF AGFLIRASIN
HEASAQFIFQ NTQTHEYARF VATTSQRSGQ PGINAKEYGE YRLFSPTEPE QTQIGNYFQK
LDTLINQHQQ KHDKLSSIKK AMLEKMFPKQ GETIPEIRFK GFSGEWEEKS VGQFGEIITG
STPSTQNLIN YSNDGIPWVT PTDISRNVTF NTAKRLSQTG CKVARIVPKD TILVTCIASI
GKNTILGTQG GFNQQINGII PNQKDNHPYF IFSASILWSE KLKRSAASGT MQIVNKTEFS
ELKTRAPKKE EQTAIGNYFQ KLDSLIDQHQ QQITKLNNIK QACLSKMFV