Gene Spro_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4231 
Symbol 
ID5604526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4691179 
End bp4694469 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content59% 
IMG OID640939791 
ProductTPR repeat-containing protein 
Protein accessionYP_001480453 
Protein GI157372464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGGA CAGTCAAGGT TTGGCAAGAA CAGGTTGAAT TACCCACCTA CGGTACCGGT 
GAGCAGGACT CGCACCCGAT GTTCCTGGAA AACCGTGTCT ATCAGGGATC TTCCGGCGCG
GTTTACCCTT ATGGCGTGAT TGATACCCTA AGCGGTGAGA AAACCTTGCG CCGTTATCAG
GCTGTCTATC TGGAAAACGA CTACCTGCGC GTGATGCTGT TACCGGAGCT GGGCGGGCGT
ATTCATCGGG CCTACGACAA AGTGCAGCAG CGTGATTTTG TCTACCACAA CGAGGTGGTC
AAGCCGGCGT TGGTGGGGCT GCTGGGGCCG TGGATCTCCG GGGGCATCGA GTTTAACTGG
CCACAGCACC ACCGGCCGAC CACTTATATG CCGGTGGATT GCCAGATTCA ACAGCATGAA
AACGGAGCGC AGACCGTCTG GCTGGGGGAA GTGGAACCGA TGCGCGGGCT GCAGGTGATG
GCCGGTTTCA CACTGTATCC CGACCGCGCG CTGATTGAAA TCAGCGCCAG GATCTTTAAC
ACCAACCCGA CACCGCGCCA TTTCCTCTGG TGGGCCAACC CGGCAGTGAA GGGCGGGGAC
GATCACCAGA GCGTGTTCCC ACCGGATGTG ACCGCCGTTT TTGACCACGG CAAGCGCGAT
GTCTCCTCGT TCCCCATCGC TACTGGCACC TACTACAAGG TGGATTACTC GGCCGGTGTG
GATATTTCAC GCTACAAGAA TATCCCGGTA CCGACGTCTT ATATGGCGTA TAAGTCGGAC
TATGACTTCG TCGGGGCCTA TAGCCACGAT GAGCAGGGCG GTTTGCTGCA TATCGCCGAT
CATCATGTAT CACCCGGCAA GAAACAATGG AGCTGGGGCC ACGGCGAATT TGGTCAGGCC
TGGGATCGCA ATCTGACCGA CAACAATGGC CCGTACATCG AGCTGATGAC CGGGGTCTAC
ACCGATAATC AGCCGGATTT CACCTGGCTC GATGCCTATG AAGAAAAATG CTTCGTGCAG
AATTTCCTGC CTTACAACAC CCTGGGCATG GTGCAGAACG CCAACACCCG GGCGGCGCTA
AAGTTGGAAA GCGACGGACG ACAGCTGGTG TGGGGGCTGT ATGCGGTCGC GCCGCTGGCG
CAGCATCGGC TGGTGGTCCG CAGCGATAGC GACCAACAGT TATTGCTCGA TCGTCGTATC
GATCTCAGCC CCGGAGCGGC CTTGATGGAG ACCATGACCG GTGATTTCTC TGGTCGTCTG
ACGATTGAAT TGTTGGACTC GCAGGGGGGC TGCGTACTCA GCTATCGTCA GCATCAGGCC
GATCCGGCGG CAGAACTGCC GCAGCCGGCC AAAGCACCGC CGCTGGCCGG GCAAATTGCC
AGTGCCGATG AGGCCTGGTT TATCGGCCAG CATCTGGAGC AATACCATCA CGCCAGCCGT
TCGGCGTTCG ATTACTACCA GAGGGGTTTG GCGCTGGATC CGCTCGATTA TCGCTGCAAT
CTGGCCCTGG CGACGCTGGA ATATAACCGC GCCAATTTTG CACGCGCTAT TGCCTATGCT
GACGATGCGC TGGCACGTGC TCACCACCTT AACCGTAACC CACAGTGTGG CTTGGCCAGT
CTGATCCGCG GCTGCGCGCA TGAACAATTG GGTGACGACA GCGCGGCCTA TGAAGATTTC
TACCGCGCCA TCTGGAGCGG CAATGGCAAA CCGGGCGGCT TCTACGGGCT GGCTCGGGTC
GCGGTGCGGC GCGGGAATTA TTCGCAGGCG CTGGAATTTT GCGAAAGCAG CCTGAGCGTT
AATGCCAGCC ATTATCCGTT GATCGCTCTC AAGGCATGGT TGTTGCAACG CCTGGGACAG
GGGGAACAGA GCCTGGTCTA TATTGCGCAG CAACTGACTG CCCGCCCGTT GCATTACAGC
CTGTATTACC TGCGTTATGC GCAAACCCGT GAGGCTGCCG ATTTACAACG CTTGCGTGCG
GTGACCGGTT GCCGTGGCAT TAATGCGTTG ACCATGGCCA ACCAATTCTG TGAATGGGGG
GCCAAACCGC AGGCGATTGA ACTGCTGACG CTGCTGGACA GCCAGGAAAG CCTGCCGCTT
TATCTGTTGG CCAGCTTGCG CAAGGGCGAG GCGGGCGATA GCGAATACCA GCAGTTGCTG
GCGCAGGCGC GTGACAGCTT TAGCCGACAG GTACGCTTCC CCAATACCCT GAACGAAGTA
CAGATGCTGA GCCAATTGCC GGAGTGCGAT TTTGCGCAGT ACCTGCTGGG GTGTTTTCAT
TACAGCAAAC GCAACTACTC GCAGGCGGTG GCACTGTGGC AGCGCTGCGT CGAGCGGCAG
CCGGGGTTTG CCGATGCCTG GCGCAACCTG GGGATCTACA GCTTCAACAA GCTCCAGCAG
CATGACGTTG CGCTGGAATA TCTGCAACGA GCATTCAGGC TACAGCCCGA CGATGCCCGG
TTATTGTTCG AGCTGGATCA ACTGAATAAG CATCTGCGAG TGGCTCCTGA ACAACGGCTG
GCCTTGCTGG AACAACATTT GGCGGTGGTG GCCAGGCGTG ATGACCTCAC AGCCGAACTG
CTGAGCCTGT ACAACCAGTG TGGCCGGTTG AGTGAGGCGC AACAGACTCT GCAACAGCGG
CAGTTCCACC CGTGGGAAGG TGGTGAAGGC AAGGTCACCG GCCAGTACCT GATCAACCTG
CAGCGCTTGG CGTTTCAGGC GCTACAGCAG GGGGATCCGC AGCAGGCGCA TGAGTTGCTG
CAGTCGGCGT TGCACTACCC GCACAATCTG GGGGAAGGCC GGCTGGCCGG GCAAAGTGAC
AACGATCTCT ACTATTGGCT CGGCATCAGT GCCGCCCGGC AAGGTGACCT TGACGCTGCC
GCCGGCTACT GGCAGCAGGC CTGTGCCGGG CAGGGTGACC TGACGCAAAG CCGCTACTAC
AACGACCAAC CGGTGGACTA TCTGTTCTAT CGCGGTATGG CACTCAAACA ACTGGGGCAG
TCGGCACAGG CCGAACAACA GTTCATGCAA ATGCAGCAAT GGGTACGGCA ACAGTCAGAG
CTGGCGCCGG GTGCCGATTT CTTTGCCGTT TCGCTGCCGG ATTTGATGGC GTTGGACAAC
GACCTGAACC AGGCGCATCA ACAGCACTGC CTGCTGGTGA CTGCTTTGGC GCTGCTGGGG
TTAGGGCAGT TAACGGCGGC GCAGCAGACA TTGGGCGAGT TGCTGGTGGT AAATCCTGCG
CATGACAAAG CGCGTTTATT CAGCGTACTG GCCGAGGTTT TAGCCAACTG A
 
Protein sequence
MHGTVKVWQE QVELPTYGTG EQDSHPMFLE NRVYQGSSGA VYPYGVIDTL SGEKTLRRYQ 
AVYLENDYLR VMLLPELGGR IHRAYDKVQQ RDFVYHNEVV KPALVGLLGP WISGGIEFNW
PQHHRPTTYM PVDCQIQQHE NGAQTVWLGE VEPMRGLQVM AGFTLYPDRA LIEISARIFN
TNPTPRHFLW WANPAVKGGD DHQSVFPPDV TAVFDHGKRD VSSFPIATGT YYKVDYSAGV
DISRYKNIPV PTSYMAYKSD YDFVGAYSHD EQGGLLHIAD HHVSPGKKQW SWGHGEFGQA
WDRNLTDNNG PYIELMTGVY TDNQPDFTWL DAYEEKCFVQ NFLPYNTLGM VQNANTRAAL
KLESDGRQLV WGLYAVAPLA QHRLVVRSDS DQQLLLDRRI DLSPGAALME TMTGDFSGRL
TIELLDSQGG CVLSYRQHQA DPAAELPQPA KAPPLAGQIA SADEAWFIGQ HLEQYHHASR
SAFDYYQRGL ALDPLDYRCN LALATLEYNR ANFARAIAYA DDALARAHHL NRNPQCGLAS
LIRGCAHEQL GDDSAAYEDF YRAIWSGNGK PGGFYGLARV AVRRGNYSQA LEFCESSLSV
NASHYPLIAL KAWLLQRLGQ GEQSLVYIAQ QLTARPLHYS LYYLRYAQTR EAADLQRLRA
VTGCRGINAL TMANQFCEWG AKPQAIELLT LLDSQESLPL YLLASLRKGE AGDSEYQQLL
AQARDSFSRQ VRFPNTLNEV QMLSQLPECD FAQYLLGCFH YSKRNYSQAV ALWQRCVERQ
PGFADAWRNL GIYSFNKLQQ HDVALEYLQR AFRLQPDDAR LLFELDQLNK HLRVAPEQRL
ALLEQHLAVV ARRDDLTAEL LSLYNQCGRL SEAQQTLQQR QFHPWEGGEG KVTGQYLINL
QRLAFQALQQ GDPQQAHELL QSALHYPHNL GEGRLAGQSD NDLYYWLGIS AARQGDLDAA
AGYWQQACAG QGDLTQSRYY NDQPVDYLFY RGMALKQLGQ SAQAEQQFMQ MQQWVRQQSE
LAPGADFFAV SLPDLMALDN DLNQAHQQHC LLVTALALLG LGQLTAAQQT LGELLVVNPA
HDKARLFSVL AEVLAN