Gene Spro_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4738 
Symbol 
ID5603874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5237497 
End bp5240511 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content62% 
IMG OID640940304 
Productouter membrane autotransporter 
Protein accessionYP_001480959 
Protein GI157372970 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTTTT GTCTGGCGGC GATCGGTGGA GCGCAGGCTA CACCTTATGT AGAGAGCGGC 
AAGCCGGGCG ATCCTGCGAG TTGGCGCAGT AATGAATTTA ACGCCGAATG GGGCCTGGGA
GCGATCCACG CCGATCAGGC CTACGCCGCC GGTTATAGCG GGAAAGGCGT AAAACTCGGC
ATTTTCGATC AACCGGTGTA TGCCAAACAT CCGGAGTTTG CCAGCCAGGG CAAAGTGATC
AATCTGGTCA CTACCGGCAT TCGCGAATAC ACCGATCCCT ACATTCCGGT GAAAAAAGGC
GACGTATTCC GCTATGACGG TACGCCGAGC GTGGATTCCG ACGGCACCCT GGGTTCACAC
GGCACCCATG TGGGCGGCAT TGCTGCCGGC AACCGCGATG GTGGTGAGAT GCACGGCGTG
GCGTTCAACG CACAAATCAT CAGCGCGGAA AACGGCGACC CCGGGCCGGA AGATGGCATC
ATTCTCGGCA ACGATGGCGC GGTATACCAG GCCGGCTGGG ATGCGTTGAT CGCCAGCGGC
GCGCGCATTA TCAACAACAG CTGGGGTATT GGCATCACTG AGAAGTTTGA TCAGGGCGGC
AAGGATCCGG CCTATCCCCA CTTCACGCTC GCCGACGCGC AAAAGCAATT CGATCAAATC
AAACAGATCC TCGGCACAAA AGCCGGTGGC GCTTATCAGG GCGCGATCGA TGCGGCACGC
AGCGGCATCG TCACCATTTT TGCCGCCGGT AACGATTACA ACCTCAATAA CCCGGACGCC
ATGGCCGGGC TGGCCTATTT CGTGCCGGAG ATAGCTCCCA ACTGGCTGTC GGTCGCCAGT
CTGCAGGATC CGAACAACAC CGGCGACTAC AGCATCAGCA CCTTCTCCTC CCGCTGTGGC
TATACCGCCA GCTTCTGCGT TGCCGCGCCG GGCAGCAAGG TTTACAGCTC AATCATCGAA
GGCAACAGCG TAGATAACCT GACCACCGGT TATGCCAAAT ACAGCGGGAC CTCAATGGCG
GCGCCGCACG TGGCCGGCAG CATTGCGGTG CTGATGGAGC GCTTCCCGTA CATGACCGGC
GCACAGGTTG CCTCGGTGCT GAAGACCACC ACCACCGATA TGGGCGCGCC GGGTATCGAT
GCGCTGTACG GCTGGGGGAT GATCAATCTG GGCAAGGCGA TTAACGGTCC GGGCATGTTT
GTTACCGAGC AGGACATTCC TGAAGAGTTT CGGGTTAGCG GCGCTTACGG CCCGGAGCAG
TTCGTGGTGA ATCTGCCGGG CGTCGGTGCG GTGCTGGACA AGGGCAAACC GACCGAGCGC
GTGTGTGACG ATACGCATTG CGGGTTGGAT GTCTGGAGCA ACAACATTTC CGGTCACGGC
GGCCTGACCA AACAGGGTAT CGGCACCCTG GTGCTGACCG GCAACAACAG CTACGCGGGG
CCGACGCTGA TCAATCAGGG CCTGCTGGCG ATCAACGGCT CGGTGACTTC CAACGTCACG
GTGCAAAACG CCGGCGTGCT GGGCGGTTCC GGCACGGTCG GGTCACTGAC CGCCCGCAGC
GGCGGTACCG TGGCACCGGG CAACTCGATC GGCACGCTGA ACGTGACGCG TAACGTAAGC
TTCGAGCCGG GTTCCCGCTA TGCGGTGGAA GTGGCGCCCA ATGGCCAGAG CGATCGGATC
CAAAGCAGCG GATCGGCGAC GATCGGCGGC GGTGAAGTCG TGGTGTTGCA GGAACAAAAC
GCCAACCTGC TGTCGCAAGG TAGCGTGAGC AGCCTGCTGG GCCGCCAGTA CAATATTCTG
ACCGCCCAGC AGGGTATCAG TGGGCAGTTT GCTTCCACTT CAACTTTCTC TCCCTTCCTG
GGCGCCGGGC TGACCTATCA ACCCAATCAG GTCACGCTGA ACGTTGGCCG TAACGACACC
TCCTTTGCCA GCGTGGCTGC CACTCAGAAT GAGCGTGCGG TAGCGGCAGC GGCGGATGCC
CTGGCGGCAG GCAACCCGGT GTACGAGAGC ATCCTCAATG CTGGCTCTAC AGGGGAAGCG
CGTCAGGCAT TCCGTCAGCT GTCCGGGCAG ATCCACGCGG ATATCGCCTC GGCGCAGGTG
AACGACAGCC GTTACCTGCG TGATGCGCTG AACGGCCGTC TGCGACAGGC GGAAGGGCTG
GCAACCTCGC CGGACATCAA GGCAGACGAC GACGGCGGTG CCTGGGCGCA ACTGCTGGGA
GCCTGGGACC ATGCGTCGGG CGATGCCAAT GCCACCGGCT ATCAGGCATC GACCTACGGC
GTGCTGGTGG GGCTGGACTC GGCGCTGGCG GACGACTGGC GGCTGGGGGT GGCGACCGGC
TACACCCGCA CCTCGCTGGA CGGCGGCTAC GGCTCGAATG CCGACAGCGA CAACTACCAC
CTGGCAGTGT ATGGCGGCAA ACAGTTCGGA GAACTGGCGC TGCGTGCCGG TGGTGGCTAC
ACCTGGCACC GCTTCGATAC CTCGCGTTCG GTCAATTACG GCATGCAGTC AGATCGGGAA
ACCGCAAAAT ACAGTGCGCG CACCGAGCAG GTGTTTGCCG AAGCGGGTTA CAGCGTGAAG
GCCGATTGGG TGAATCTGGA GCCGTTCGCC AACCTGGCGT ACATCAACTT CCAGAACAAC
GGTATCTCGG AGGACGGCGG GGCGGCGGCG CTGCAGGGCG ACAAGCAACA CACCGACGCG
ACCGTCTCGA CGCTGGGGCT GCGTGCGGAT ACCGAGTGGC AGGCGAGCAA AACCACGTCG
GTGGCGCTGC GCAGCGAACT GGGCTGGCAG CATCAGTACG GCGATTTGGA TCGTGGCACC
GGGCTGCGCT TCAACGGGGG TAATTCCCCG TTTGTGGTGA ACAGCGTGTC GGCCTCGCGT
GACGGTGCGG TGCTGAAAGC CAGTGCGGAA GTGGCGGTGA ACAAGAACGC CACGCTGTCG
CTGGGCTACG GCGGGTTGCT GTCGCAAAAC TACCAGGACA ACAGCGTCAA CGCTGGCTTC
ACCTGGAAAT TCTGA
 
Protein sequence
MLFCLAAIGG AQATPYVESG KPGDPASWRS NEFNAEWGLG AIHADQAYAA GYSGKGVKLG 
IFDQPVYAKH PEFASQGKVI NLVTTGIREY TDPYIPVKKG DVFRYDGTPS VDSDGTLGSH
GTHVGGIAAG NRDGGEMHGV AFNAQIISAE NGDPGPEDGI ILGNDGAVYQ AGWDALIASG
ARIINNSWGI GITEKFDQGG KDPAYPHFTL ADAQKQFDQI KQILGTKAGG AYQGAIDAAR
SGIVTIFAAG NDYNLNNPDA MAGLAYFVPE IAPNWLSVAS LQDPNNTGDY SISTFSSRCG
YTASFCVAAP GSKVYSSIIE GNSVDNLTTG YAKYSGTSMA APHVAGSIAV LMERFPYMTG
AQVASVLKTT TTDMGAPGID ALYGWGMINL GKAINGPGMF VTEQDIPEEF RVSGAYGPEQ
FVVNLPGVGA VLDKGKPTER VCDDTHCGLD VWSNNISGHG GLTKQGIGTL VLTGNNSYAG
PTLINQGLLA INGSVTSNVT VQNAGVLGGS GTVGSLTARS GGTVAPGNSI GTLNVTRNVS
FEPGSRYAVE VAPNGQSDRI QSSGSATIGG GEVVVLQEQN ANLLSQGSVS SLLGRQYNIL
TAQQGISGQF ASTSTFSPFL GAGLTYQPNQ VTLNVGRNDT SFASVAATQN ERAVAAAADA
LAAGNPVYES ILNAGSTGEA RQAFRQLSGQ IHADIASAQV NDSRYLRDAL NGRLRQAEGL
ATSPDIKADD DGGAWAQLLG AWDHASGDAN ATGYQASTYG VLVGLDSALA DDWRLGVATG
YTRTSLDGGY GSNADSDNYH LAVYGGKQFG ELALRAGGGY TWHRFDTSRS VNYGMQSDRE
TAKYSARTEQ VFAEAGYSVK ADWVNLEPFA NLAYINFQNN GISEDGGAAA LQGDKQHTDA
TVSTLGLRAD TEWQASKTTS VALRSELGWQ HQYGDLDRGT GLRFNGGNSP FVVNSVSASR
DGAVLKASAE VAVNKNATLS LGYGGLLSQN YQDNSVNAGF TWKF