Gene Spro_4729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4729 
Symbol 
ID5602836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5225349 
End bp5226635 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID640940295 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001480950 
Protein GI157372961 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.257698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG TATTACCAAG GAAAATATCC TGCCTTTCGG CCATGATAAT GTTTTGCCTC 
AGTGGCGGAG CCGCCTTTGC TGACGCCTCC ATTGTGCCGG TTGGCGGCAA CGTTAACCTC
GCTGCCAATA ACGTGCCGGT AGTGAATATC AATCAGCCGG GCCAGGATGG CGTATCACAT
AACCAATACA GCCAGTTTGA TGTAGGTAGC CAGGGCGCGG TATTGAACAA TGCGCAAGGC
AGCGCGCAAA GCCAACTGGC CGGTGCCATC ACCGCCAACC CCAACCTGAG CGCAGGTGCC
GCGAAGGTGA TTCTTAATGA AATCAATTCC AGCGACAAAA CGCTGCTGAA TGGGATGATC
GAAGTCGCCG GCCAGAAGGC GGACGTTATC GTCGCCAACC GTTCCGGCAT TACCTGTAAC
GGTTGCGGAT TTATCAATAC CGGTACCGGG GTACTGACCA CCGGTGAACT GCAGTTCAAA
GACAAACAGT TCACCGGCTA TAGCGTTACC GGCGGCACGG TAGCATTCGA AGGCAAGGGG
CTGCAGAAAG GTGATGTGGA CTACACCGCC GTGATTGCCC GCGCTGAGAA TATCAATGCC
GCGGTGCAAG CCAAGAGCCT GCTGGTGTTG GCGGGCAAGA AAACCGTCAC CGCAGATCTG
ACTCGTTACA TAAACCTGGA AAACAAAGAA AAATCACCAG AAGTGTTGGT AGATGTCTCC
CAGCTCGGTG GTATGTATGC AGGAAAAATC ATCTTAATCG CCAATGAAAA TGGTGTGGGG
GTTAATGACA GTACACCCGT CAGCCTTAGA AATAACGGCG AATTAGTGAC TGATGGCGGT
GATATGTTGT TGTCTGCTAC GTCTCTTTCA AACGCGGGTA AAATCGCTGC TGGCGGCAAT
GCAAACCTGG CAACCACCGT GCTGTCGAAC CAAGGGGATA TCACGGCCAA TAACGCACTT
AATATCAGCG CCACCTCTTT CAGCAACGCT AAAGACATCA GCGCAAAAAA TAATATTGAT
ATTAAGTCAA CCACTACGTC TAACAATGGG ACAATAGTCT CTCAGCAAGG AGGGGTGACC
GTATCGGATA CGACCTTCTC CAACAACGGC ACTATTCGGG CAAAAAATAA CATTTCTCAT
AAAGGAACGT TATTTAGCAA TAACGGTACC CTGACCTCAA CCCAAGGTTC AGTGTTCAAC
AACGGCAGAG ACATTAATTA CAAGCCTCCT GTCGTCACCC CACCGACAAA CTCATGGTGG
TCATCCTGGT GGTCCCGTAG CTGGTAA
 
Protein sequence
MKKVLPRKIS CLSAMIMFCL SGGAAFADAS IVPVGGNVNL AANNVPVVNI NQPGQDGVSH 
NQYSQFDVGS QGAVLNNAQG SAQSQLAGAI TANPNLSAGA AKVILNEINS SDKTLLNGMI
EVAGQKADVI VANRSGITCN GCGFINTGTG VLTTGELQFK DKQFTGYSVT GGTVAFEGKG
LQKGDVDYTA VIARAENINA AVQAKSLLVL AGKKTVTADL TRYINLENKE KSPEVLVDVS
QLGGMYAGKI ILIANENGVG VNDSTPVSLR NNGELVTDGG DMLLSATSLS NAGKIAAGGN
ANLATTVLSN QGDITANNAL NISATSFSNA KDISAKNNID IKSTTTSNNG TIVSQQGGVT
VSDTTFSNNG TIRAKNNISH KGTLFSNNGT LTSTQGSVFN NGRDINYKPP VVTPPTNSWW
SSWWSRSW