Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4191 |
Symbol | |
ID | 5604966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 4643249 |
End bp | 4645783 |
Gene Length | 2535 bp |
Protein Length | 844 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640939751 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001480413 |
Protein GI | 157372424 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTA TTAAGCCGCT GCGTCTGAGC GTATTAAATC GACCTTTCCG CCAGCAGGGT AAAAATTATC TCGGCGTTTC TGTGATGGCA CTGCTGGATA TGGGCACAAC GCCCCAGCTG CGTCCGGAAG TGGAATTATG GCAACTGGCG GCGGAAGAAC TGCAAACCAG TGGCGGCGTC ATTGATGTGG CCATTCCGAA AGTCCGGGCT GAATTTCTGG CGACGGGTAA TGCTTATACC CGTCATCAGA AGGAAAAAAA CAGCTGCGCA GTGCGCATTG ACGTGGGCAG CCTCAGCAAG ACCCTGGTGG CTTTTGGCGA TCGTTTCTGG TCTGGTTCGC AGCCGACCCC GCCACGCAAT TTTGAGTCGA TGCGCCTTGA CTGGAGCCGT GCTTTCGGCG GCGCAGGATA CGAAGAGAAC CCTCATGGTA TTGGCGCCGT GGAAGAGCAG CACAACGGCG CGGCTTTTCG TCGCCTGCCA AATATCGAAC CGCTACATCA ACGTATGACC TCACCGCGGC AGCAGCCCGA GCCGGTAAGT TTTGGGCCGC TGGACATGAC CTGGCCGCGT CGTTTTAAGC GCATGGGCAA AGCCTATGAC GCCAACTGGC TGAAAAATGA CTTTCCAGGC CTGGCCCGCG ACGCCGACTG GCGTGTATTC AATGCCGCCA GCCCGGACCA ATGGTGGCCG GAACAAGATG AGCTGCCGCC TGAGGCCGAG TGGCGCATTT GGAATATGCA TCCGGAAAAA CCGCTACAGT CGGGCAAACT GCCGCCATGG CGGGCGCGCT GTTTTATCAA TCGTCAGCGT GGGGATGAAA CGCTGTTTGA AGAGATTAGG TTGCGTGCCA CCACGGTCTG GTTCTTCCCG CATCGGCAAC AGATGATGCT GATCTGGCAG GGCAGTGGCC GCATTAACGA GGATGATGCG GCGGATGTGC TGCAACTGAT GCCGGCGCTG GAGAAAAACG GCGCCGCTCG TTCGGCCAAT CACTATCGCA AGGTGTTGGC CCAGCGGCTG GATAAAGGGA AGGGGGCGCT GTTTGCCTTC CGTGAAAAAG ATTTGTTGCC GGAAGAGCTT ATCGGTCCCT GGATTGACAG CGAAATGCAG CAAACAGAAA GCCCGATGCA AAACAATATG CAAAACCGGG TAAGTAAAAT ACGCGAACAG CACCGCGCCA GGCTTGAGGC TGAAGGGCGG GATGTTAACG AGCTACTGGC GGATATGTCG CAGCCAGAGA TGCCGAGGCT GGATGAACTG CCGGAGTTTG TCGAACGACT GGAGCGCCAG GCTGAGGAGA TGAAAGCGCA GGCTGAGGCG CGCCAGGCTG AAGTGGAAGC CCGCCAGGGT GCCAAACAGG ACTCACGGCC GCGTGGCCCG GAATCAATGC ACCGTATGCA GGAGATGCTG CATCAACATG CGGACAGTAT GACGGCGAAG AAATTGGCGC AGAGCCGCGA ATCGCTGCAC CAGATGTATC TGATGGCAGC CCAGCATCAA CCGCCGGCGC AACGCATGAC GGGGGATATT GCCCGGATTA TTCGCCAGCG CGCTTCGAAC ACCATGGCGT GCGGAGGGGA CTTTAGCGAG TTGGATTTCA CCGGCGCGAA CCTCTCCGGC ATGGATTTTC GCGGCGCTAA TTTCCGTAAG GCGTTGCTGG AAAGCGCCGA TCTCAGCGGT TGTCAGTTGG ACGGCGCTGA TTTTAGCGAG GCGATGCTGG CGCGAACCGA TTTGCGCAAC AGTTCGCTGC GTGAATGCAA TCTGACCAAA GCCAGCCTGG CGTTGGCCCA GTGCCGACAG ACCGATTTTA GCGGCGCCAA CCTGACGGAA ACCCAACTGG AAGACGCGCT GTTTGAAGAC TGTGACTTCA GCCGTGCCAC GCTAAAAACG CTATTGCTGC GCCAGGTCGG TATTAGCCAT TGCCGTTTCC ACCGTGCGGA GCTGGAGGAG TGTATCGTTA TGAATTTAAC GCTGCCGCAG CTCGATTTCA GCGAGGCCAG GCTGCGTAAG ACCGTCTTCC AACAGTGCGA GTTGCAGGCT GCGGTCTTCA ACGGTGCGTG GCTTGAGAGC TGTAACTGGG TGGAGAGCAA ACTGCCGTAC GCCCAGTTTA AAGCGGCCAG CCTGCTGACC TGCGCAGCCG TAATGGAGAG TGACTTAAGC GGCGCGGACT TTAGCGAAGC AACGCTGAAA GAAAGTAATC TGCGCCAGGC TTTGCTTACG CAGGCGAACT TTACGTTGGC GAAGGTGGAG AACAGCGATC TCAGCGAGGC GGACTGTCAG AGGGCGAATT TCACCCGCGC CAACCTGGTC GGGAGCCTGT TGATCCGCAC TGATTTCCGT CAGGTCAATT TCACCGGAGC CAACCTGATG GGCGCACTGA TGCAGAAAAC CCAACTGGGC GGCGCGGACT TTACCGCTGC CAATCTGTTC CGCGCCGATC TGTCACAATC CTTTATTAAT ATGGAGACGC GGCTGGACAA CGCCTATACC AGCCGGGTGA AGACTCTGCC GAAGCGGGAT GAGGAACTGT CATGA
|
Protein sequence | MKIIKPLRLS VLNRPFRQQG KNYLGVSVMA LLDMGTTPQL RPEVELWQLA AEELQTSGGV IDVAIPKVRA EFLATGNAYT RHQKEKNSCA VRIDVGSLSK TLVAFGDRFW SGSQPTPPRN FESMRLDWSR AFGGAGYEEN PHGIGAVEEQ HNGAAFRRLP NIEPLHQRMT SPRQQPEPVS FGPLDMTWPR RFKRMGKAYD ANWLKNDFPG LARDADWRVF NAASPDQWWP EQDELPPEAE WRIWNMHPEK PLQSGKLPPW RARCFINRQR GDETLFEEIR LRATTVWFFP HRQQMMLIWQ GSGRINEDDA ADVLQLMPAL EKNGAARSAN HYRKVLAQRL DKGKGALFAF REKDLLPEEL IGPWIDSEMQ QTESPMQNNM QNRVSKIREQ HRARLEAEGR DVNELLADMS QPEMPRLDEL PEFVERLERQ AEEMKAQAEA RQAEVEARQG AKQDSRPRGP ESMHRMQEML HQHADSMTAK KLAQSRESLH QMYLMAAQHQ PPAQRMTGDI ARIIRQRASN TMACGGDFSE LDFTGANLSG MDFRGANFRK ALLESADLSG CQLDGADFSE AMLARTDLRN SSLRECNLTK ASLALAQCRQ TDFSGANLTE TQLEDALFED CDFSRATLKT LLLRQVGISH CRFHRAELEE CIVMNLTLPQ LDFSEARLRK TVFQQCELQA AVFNGAWLES CNWVESKLPY AQFKAASLLT CAAVMESDLS GADFSEATLK ESNLRQALLT QANFTLAKVE NSDLSEADCQ RANFTRANLV GSLLIRTDFR QVNFTGANLM GALMQKTQLG GADFTAANLF RADLSQSFIN METRLDNAYT SRVKTLPKRD EELS
|
| |