Gene Spro_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1039 
Symbol 
ID5606585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1139974 
End bp1143225 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content59% 
IMG OID640936558 
Productexonuclease SbcC 
Protein accessionYP_001477271 
Protein GI157369282 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.772605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.479303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC TCAGCCTGAG ATTAAAAAAC CTTAACTCCC TGCAGGGTGA ATGGAAGATC 
GATTTTACCG CCGAGCCCTT TGCCAGTAAT GGCCTATTCG CCATCACCGG GCCTACCGGC
GCGGGCAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGT TGTACCACCA GACGCCGCGC
CTTAACGTCA CGCCGAGCCA GAACGAGTTG ATGACGCGCC ACACCGCCGA ATCGCTGGCG
GAGGTCGAAT TTGAAGTTAA AGGGGTCGGC TATCGCGCCT TCTGGAGCCA GCGCCGCGCC
AAAAATGCCC CGGACGGCAA CCTGCAGGCA CCGAAGGTGG AGCTGGCGCT GCGTGAAGAC
GGAAAAATTC TGGCGGACAA GGTACGCGAC AAGCTGGATA TGACCGCCGC GATCACCGGG
TTGGACTTCG GCCGCTTCAC CAAATCGATG ATGTTATCCC AGGGGCAGTT CGCCGCCTTC
CTCAATGCCG ATGCCAACGA TCGTGCCGAA CTGCTGGAGG AACTGACCGG CACCGAGATC
TACGGCCAGC TTTCCGAGCG GGTTTTCGAA CAGCATAAGC AGGCCAGGGC CGATTTGGAT
GCGCTGCATC AACGCGCCAG CGGCATTGAA CTGCTCAATG ACGAACAGCG TCAGCAGCTT
GAAAACCAAT TAATTGAGCT GGGAACGCAG GAAAATACCC TTGGCGAACA GGCCCGCCAG
CAGCAGCAGG CGCTGAACTG GTTACAGCAG TGGCAACAAA CACAACAAAA ACAACAGGAA
TACCAACGTC AACTGAACGC GGCCCAGCAG GAACAGGCGG CGGCCGAACC GCAACTGCAG
CAGTTGGCGC GCAGCGAGCC GGCAGAAAAA CTGCGGCCGT TGCTGAATGA ACGTAACCGC
TGCCGCAACG ATCAACAGGC GCTGCAACAG CAGATAAGCC AATTAACGCA ACAGCAAGAG
CAACAGCTAG CCCAGATGGC GCCATTGCAA CAGGCGGTGG AGCAGGCCCG TGCGCAACTG
CAGGCGCATG TTGAACATCA GCAGACGCAA CAGCGGCTGA TCGACGAGCA AGTCGTCCCG
CTCGATCACC AGATTGCCCA GTTACAGAAA ACGCAGGCCG AGCTGCAACA GGCGTGCGAT
AGCGCTGCAC AGCAGTATGC CCAGCAAAAA ACAGCGCTGG AACAACTCGA GGTTCAGCGA
AGCCAGTTGA CCGCACAGGC CGGGCAGCAT CAGGAAAAAC TTAACGGCCT CACCGCCGCG
CTGACGGCGC AACAGCAACA ACAAAATACG CTGGAAGCCC AGTCGCCCCT GATGGTACTG
CGCCAGCGCC AGACCGAACT GGCCGAGCTG CGCCCGGTAC GCCAGCAGTT ATTTACCCTG
TCTTCCCTGT TTAGCCAGGT CAGCGAACGG CTGGAGCAGC AGCGCAGGGA GTTTGCCGCC
CACCAGACGC AACTCGGCGA ACATGAAAAA CAGCTCGAAG AGACGCGGCA GCAGTATAAA
CAGCAAAAAA ACCACCAGGC AGACGTGGAA AAAGCCCTCG AGCTGGAACA GCGCATTGTC
AGCCTGGAAG CCGAGCGCGC CCGCCTGCAA ACCGGTGAAC CCTGTCCTCT GTGCGGCTCC
TGCAGCCACC CGGCCATTGA GCAGTATCAG GCGGTAAAAC CTTCTGAAAC CGCATTGCGC
CTGGCCCAGA TGCGCGCCAC AACCGAGGCT TTGTATACTC AGGGCACCGA ACTGCGCACC
CGTTGCGAAA GCCTGAAAGA ACAGCAACAG CGCCAACAGC AGCTCATCAC GCAAGACGAG
CAACAGTTGG CTACTCACCT GCAGCAGTGG CAAACCCTCA GTACACCGCT GGCATTTGAC
TTTACCCTGC AAGACGCCGA CCGCCTGAGC CAATGGCTGA ACGCCTGCGA CAACGAAGAA
CGTCAGGGTC AGGCCACCCT GCTGCAACAC GAACAGGCAT CGCACGCCGT GCAGCAGGCC
AAAGACCAAT TGGCCGCACT GCAAACGCAA CAGCAGCAGG CGCACCAAGA GAAAGCGCGG
CTGGACGAGC GTTACATCCT GCTGGAGAAA AACCACGCCG AAGCCCAGTT GCAACAGCAG
CGGCTGCAGC AGCAGTGGCA GGACGGCGAA CGAGTCCTTG GCGAGCAACG TTCGCAGCGT
CTGGCCCTGT TTGGTGAACA GCAAATCACC CAGGTGCGCG AACAGTTACG CGCCCGGCAA
ACGGCCTGCG AGCAGGCCAG CCAGCAGGCG GCCGAACAGT GGCAGAAGGC ACAAGAGCAA
CGCGATCGAC TGAGCGGCCA GTTGGCCGGG TTACAACAGC AACTGCTGCA GCAGACCGAG
CGGCTGCAAC AGGCGGAACA AGTGTGGCTA CAGGCGTTGG CCGCCAGTGA ATTCAGCGAT
GAAACCGCCT TCAGCGCCGC GCTGTTGGAT GACGACCAGC GCCGACAGCT GCAACAGCGC
AAAGAGCAGC TGCAACAGCG CCAGGTTGAA GCCAGCGCCC TGCTGGCACA GGCCGTTGAC
AGCCTGAAAC AGCAGTTGCA ACAGCGTCCC GAAGGGCTGG ATGAGCATCA GACCGATCCC
CAGGCCCTGA GCCAGAGTCT GGCAGCGCTG GCGCAGCAGT TAAAAACCTT GCAGCTACGC
CAGGGCGAAG TGCGCAATCA ATTAGAAAGC GACACTGCCC GCCGCATCAA TCAGCAGTCG
CTGTTTGAGC AAATCAGCCA GAGCCAGAGC CAGTACGACG ACTGGAGCTA TCTCAATCAG
CTTATCGGCT CCAAAGAAGG GGATAAATTC CGTAAGTTTG CCCAGGGGCT GACGCTCGAT
CACCTGGTGT ATCTGGCCAA TAACCAGTTG GGGCGGCTGC ACGGGCGCTA TTTGTTGCAA
CGAAAAACCA GCGATGCGCT GGAGCTACAG GTAGTGGATA CCTGGCAGGC GGACGCGCTG
CGCGATACCC GCACCCTGTC CGGCGGTGAA AGCTTCCTGG TGAGCCTGGC GCTGGCACTG
GCGCTGTCTG ACCTGGTCAG CCACAAGACC AGTATCGACT CGCTGTTCCT CGACGAAGGC
TTTGGCACGC TGGATGCAGA AACCCTGGAT ACCGCGCTGG ATGCGCTCGA CAGCCTGAAC
GCCTCCGGGA AAACCATTGG CGTGATCAGC CACGTGGAAG CCATGAAAGA ACGGATCCCG
GTACAAATTA AGGTGAAAAA GGTCAACGGG CTCGGCATCA GCCGGCTGGA GCCGCAGTTC
CGGCTGGAAT AA
 
Protein sequence
MKILSLRLKN LNSLQGEWKI DFTAEPFASN GLFAITGPTG AGKTTLLDAI CLALYHQTPR 
LNVTPSQNEL MTRHTAESLA EVEFEVKGVG YRAFWSQRRA KNAPDGNLQA PKVELALRED
GKILADKVRD KLDMTAAITG LDFGRFTKSM MLSQGQFAAF LNADANDRAE LLEELTGTEI
YGQLSERVFE QHKQARADLD ALHQRASGIE LLNDEQRQQL ENQLIELGTQ ENTLGEQARQ
QQQALNWLQQ WQQTQQKQQE YQRQLNAAQQ EQAAAEPQLQ QLARSEPAEK LRPLLNERNR
CRNDQQALQQ QISQLTQQQE QQLAQMAPLQ QAVEQARAQL QAHVEHQQTQ QRLIDEQVVP
LDHQIAQLQK TQAELQQACD SAAQQYAQQK TALEQLEVQR SQLTAQAGQH QEKLNGLTAA
LTAQQQQQNT LEAQSPLMVL RQRQTELAEL RPVRQQLFTL SSLFSQVSER LEQQRREFAA
HQTQLGEHEK QLEETRQQYK QQKNHQADVE KALELEQRIV SLEAERARLQ TGEPCPLCGS
CSHPAIEQYQ AVKPSETALR LAQMRATTEA LYTQGTELRT RCESLKEQQQ RQQQLITQDE
QQLATHLQQW QTLSTPLAFD FTLQDADRLS QWLNACDNEE RQGQATLLQH EQASHAVQQA
KDQLAALQTQ QQQAHQEKAR LDERYILLEK NHAEAQLQQQ RLQQQWQDGE RVLGEQRSQR
LALFGEQQIT QVREQLRARQ TACEQASQQA AEQWQKAQEQ RDRLSGQLAG LQQQLLQQTE
RLQQAEQVWL QALAASEFSD ETAFSAALLD DDQRRQLQQR KEQLQQRQVE ASALLAQAVD
SLKQQLQQRP EGLDEHQTDP QALSQSLAAL AQQLKTLQLR QGEVRNQLES DTARRINQQS
LFEQISQSQS QYDDWSYLNQ LIGSKEGDKF RKFAQGLTLD HLVYLANNQL GRLHGRYLLQ
RKTSDALELQ VVDTWQADAL RDTRTLSGGE SFLVSLALAL ALSDLVSHKT SIDSLFLDEG
FGTLDAETLD TALDALDSLN ASGKTIGVIS HVEAMKERIP VQIKVKKVNG LGISRLEPQF
RLE