Gene Spro_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4737 
Symbol 
ID5603861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5234327 
End bp5237428 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content60% 
IMG OID640940303 
Productouter membrane autotransporter 
Protein accessionYP_001480958 
Protein GI157372969 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG CTACAGGAAA GAATAAACCG GCATTGGCGC CGACATGGAA ACTGAACGCC 
TTGCTGTGCG CGCTGTTGGC GGCCGGTGGC GTACAGGCTG CGCCCTATAC GGAAGGCGGT
AAGTCAGGCG ATCCGGCAAG CTGGCGCAGT AATGAATTCA ACGCCAACTG GGGGTTGGGG
GCCATTCACG CTGACGAGGC CTATGCCGCG GGTTATACCG GTAAAGGGCA AAAGGTGGGC
ATCTTTGATA CCCCGGTGAA TAAACACCCT GAATTCGCCG GTGACGGCAA ACTGGTGAAC
GTGGTGACCC AGGGGAACCG CGCCTATACC GATCCGCATC GGAAGGGGAT TAAGGCCGGC
GATCGTTTTT ACTTCGACGG CACCTTCCAT TTCTATAATG GCGATGGGGT GACGCTGGTT
AACCACGGCG TACACGTGGC CGGCATCAGC GGGGCCAATC GTGACGGCAT CGGTATGCAT
GGCGTGGCCT TTGATTCACA GGTCATCAGC GTTGATAACG ACAATGATGG CCCGGCCTAT
GGCGAATTCC TCGGGCTGGA TGGTAGCGTA ACCAACGCCG GTTGGCAGGC GATGATCAAC
AATGGCGTGC GGGTTATCAA TAACAGTTGG GGCGTCAGTA TTCCCGATTT CTTATCGGAT
AATGGCAAAA AGCCAGACGC GCTGCATTTC GAACTGAAAG ACGCACAGGA ACAGTTTGAT
CAGGTGAAAC CGCTGCTGGG CAGCCTGGCG GGAGCGGGGT ATCAGGGCGC GATCGATGCG
GCACGCAAAA ACATTTTGGT CTTGTTCGCC GCCGGTAACG ACGGCAACTA CACCCAACCT
GACGTGATCA GCGGTCTGGC CTATTTTGTG CCGGATATTG CGCCTAACTG GCTGTCGGTT
GCCAGCGTCG CACAGGATGA TGCCTCCACC AACAGCGTGC CTTATACCAT CAGCAGCTTC
TCTTCACGCT GCGGCTATAC CGCCAGTTTC TGCGTCTCGT CACCGGGCAG TAAAATCTAC
AGCACCGTGG CCAACGGTTC CGATCCGAAT CAACTGGTCT CGGACTATGG CAATAAAAAT
GGGACCTCAA TGGCGACGCC GCACGTGACC GGCGCGGTTG CCGTGTTACT GCAGCGTTTC
CCGTACATGA CGTCGGCGCA AATTGCCGAC GTGTTGAAGA CCACCGCCAC CGATATGGGC
GCACCGGGGA TCGATGCGCT GTACGGCTGG GGGATGATCA ATCTGGGCAA GGCGATTAAC
GGCCCGGGCA TGTTCTATAC CGTCGAAGAT ATTCCTGAAG AGTTCCGCAT TCCGGATCCG
ACAGGCGTGG CTTACGGCTC GTCACAGTTT GTCGCCAACA TTCCCGGCTT CGGGGCGCAA
ATCGATCAAG GCACGCTGCA GGCTCGTATT TGTGATGACT ACCACTGTTC GGTGGAGGTC
TATTCCAATG ACATCTACGG CCATGGCGGC CTGACCAAGG AAGGCCAGGG GGCGCTGGTA
CTGACCGGCA CCAATACCTA TTCCGGCCCG ACCTGGGTAA ATACCGGCCT GCTGGCGGTT
AACGGTTCGG TCACATCAGA CGTCACAGTG CAAAACAGCG GCATGCTGAG TGGTTCCGGT
ACTGTCGGTT CACTGACCGC TCGCAGCGGC GGTACCGTGG CACCGGGCAA CTCGATCGGC
ACGCTGAACG TGACGCGTAA CGTCAGCTTC GAGCCGGGTT CCCGTTATGC GGTGGAAGTG
GCGCCCAACG GCCAGAGCGA TCGGATCCAA AGCAGCGGAT CGGCGACGAT CGGTGGCGGT
GAAGTCGCTG TTTCGCTGGA GAACAGCACC AATCTGCTGT CGCAAAGCGA AGTCCGCAGC
CTGCTGGGCC AGCAGTACAA CATTCTGACC GCTCAGCAGG GTATCAGCGG GCAGTTTGAT
TCGGTGGCTC CAAACTACCT GTTCCTGGGC ACCGGGCTGA CTTACCAACC CAATCAGGTC
ACGCTGAACG TTGGCCGTAA CGACACCTCC TTTGCCAGCG TGGCAGCAAC TCAGAACGAG
CGTGCGGCAG CGGCAGCGGC GGATGCCCTG GCGGCGGGCA ACCCGGTGTA CGAGAGCATC
CTCAATGCTG GCTCTACAGG GGAAGCGCGT CAGGCGTTCC GTCAGTTGTC CGGGCAGATC
CACGCGGATA TCGCCTCGGC ACAGGTTAAC GACAGCCGTT ATCTGCGTGA CGCGTTGAAC
GGCCGTCTGC GACAGGCGGA AGGGCTGGCA ACCTCGCCGG ACATCAAGGC GGACGACGAC
GGCGGTGCCT GGGCGCAACT GCTGGGAGCC TGGGACCATG CGTCGGGCGA TGCCAATGCC
ACCGGCTATC AGGCATCGAC CTACGGCGTG CTGGTGGGGC TGGACTCGGC GCTGGCGGAC
GACTGGCGGC TGGGGGTGGC GACCGGCTAC ACCCGCACCT CGCTGGACGG CGGCTACGGC
TCGAATGCCG ACAGCGACAA CTACCACCTG GCAGTGTACG GCGGCAAACA GTTCGGTGAA
CTGGCGCTGC GTGCCGGCGG TGGCTACACC TGGCACCGCT TCGATACCTC GCGTTCGGTC
AATTACGGCA TGCAGTCAGA TCGGGAAACC GCAAAATACA GTGCGCGCAC CGAGCAGGTG
TTTGCCGAAG CGGGTTACAG CGTGAAGGCT GATTGGGTGA ATCTGGAGCC GTTCGCCAAC
CTGGCGTACA TCAACTTCCA GAATAACGGT ATCTCGGAGG ACGGCGGGGC GGCAGCGCTG
CACGGTGACA AGCAACATAC CGACGCGACC GTCTCGACGC TGGGGCTGCG TGCGGATACC
GAGTGGCAGG CGAGCAAAAC CACGTCGGTG GCGCTGCGCA GCGAACTGGG TTGGCAGCAT
CAATACGGCG ATTTGGATCG CGGCACCGGG CTGCGCTTCA ACGGGGGTAA TTCCCCGTTC
GTGGTGAACA GCGTGTCGGC CTCGCGTGAC GGCGCGGTGC TGAAAGCCAT TGCGGAAGTG
GCGGTGAACA AGAACGCCAC GCTGTCGCTG GGCTACGGCG GGTTGCTGTC GCAAAACTAC
CAGGACAACA GCGTCAACGC CGGTTTCACC TGGAAGTTCT GA
 
Protein sequence
MKQATGKNKP ALAPTWKLNA LLCALLAAGG VQAAPYTEGG KSGDPASWRS NEFNANWGLG 
AIHADEAYAA GYTGKGQKVG IFDTPVNKHP EFAGDGKLVN VVTQGNRAYT DPHRKGIKAG
DRFYFDGTFH FYNGDGVTLV NHGVHVAGIS GANRDGIGMH GVAFDSQVIS VDNDNDGPAY
GEFLGLDGSV TNAGWQAMIN NGVRVINNSW GVSIPDFLSD NGKKPDALHF ELKDAQEQFD
QVKPLLGSLA GAGYQGAIDA ARKNILVLFA AGNDGNYTQP DVISGLAYFV PDIAPNWLSV
ASVAQDDAST NSVPYTISSF SSRCGYTASF CVSSPGSKIY STVANGSDPN QLVSDYGNKN
GTSMATPHVT GAVAVLLQRF PYMTSAQIAD VLKTTATDMG APGIDALYGW GMINLGKAIN
GPGMFYTVED IPEEFRIPDP TGVAYGSSQF VANIPGFGAQ IDQGTLQARI CDDYHCSVEV
YSNDIYGHGG LTKEGQGALV LTGTNTYSGP TWVNTGLLAV NGSVTSDVTV QNSGMLSGSG
TVGSLTARSG GTVAPGNSIG TLNVTRNVSF EPGSRYAVEV APNGQSDRIQ SSGSATIGGG
EVAVSLENST NLLSQSEVRS LLGQQYNILT AQQGISGQFD SVAPNYLFLG TGLTYQPNQV
TLNVGRNDTS FASVAATQNE RAAAAAADAL AAGNPVYESI LNAGSTGEAR QAFRQLSGQI
HADIASAQVN DSRYLRDALN GRLRQAEGLA TSPDIKADDD GGAWAQLLGA WDHASGDANA
TGYQASTYGV LVGLDSALAD DWRLGVATGY TRTSLDGGYG SNADSDNYHL AVYGGKQFGE
LALRAGGGYT WHRFDTSRSV NYGMQSDRET AKYSARTEQV FAEAGYSVKA DWVNLEPFAN
LAYINFQNNG ISEDGGAAAL HGDKQHTDAT VSTLGLRADT EWQASKTTSV ALRSELGWQH
QYGDLDRGTG LRFNGGNSPF VVNSVSASRD GAVLKAIAEV AVNKNATLSL GYGGLLSQNY
QDNSVNAGFT WKF