Gene Spro_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3641 
Symbol 
ID5606101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4026711 
End bp4028621 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content58% 
IMG OID640939192 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_001479865 
Protein GI157371876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA CCAGACGCGA TTTTCTTAAT GGGGTGGCGA TCACTATCGC CGCCGGGTTA 
ACGCCGATGC AGATCCTGCG GGCATCGCCG CAAACCGCCA ATCAAACCCT CTATTATCCG
CCGACGCTGA CCGGATTGCG GGGCAACCAT CCCGGTTCGT TTGAGCATGC TCACCAACTG
GGGCGTGACG GCAAGGCCTT CGATTTTGCC AGCATCCCGG CGACGGAAGA GTTCGATCTG
GTGGTAGTCG GCGCCGGGAT CAGCGGACTG GCCGCCGCCT GTTTCTGGCA GCAAATGAAA
GGTCAGCAGC AGCGTATCTT GCTGATCGAC AACCATGATG ATTTCGGTGG CCACGCCAAG
CGCAATGAAT TCAGCAGCGA AAATGGCACC ATTCTCGGCT ACGGCGGCAG CGAGTCGCTG
CAGTCGCCGC GCTCCAACTT CAGCCCGGTG GCGATGAGGC TGCTGCAAAA GCTGGGCGTC
AGCATCGACA ACCTGGAAAA GGCTTTCGAT AAAACCTTCT ACCCGGATCT TAACCTGAGC
CGTGGCGTCT ATTTCGATCG CAAAAACTTC GGCGTCGACA AAGTGGTGAA CGGGGATCCT
GGCCGTATGG TGGCGGATGA TATTCCCCAT GACCGCCTTA ATGGCCGTTC CTACGAAGCC
TTTATCGGTG ATTTCCCGCT GCCGGAAAGC GATCGCCAGG CGCTGATTGC ACTGCATACG
GTGGATAAGG ATTACCTGCC GGAAATGAGT CAGGAGCAGA AAAGCGAATG GCTCGACAAG
CACAGTTATA CCGAATTCCT GCGTGACAAG GTTGGCCTGA GCGAAATGGC GATCCGCTAT
TTCCAACAAA CCACCAGTGA CTTCCAGGCG GTGGGTATCG ACGCCACTTC GTGCAGCGAT
GCGCGTATTT GCGATCTGCC TGGCCTGAAC GGCATGAACC TGCCGCCGCT GGATGAAGAG
TCACAGGCGG ATCTCGACGA TCCTTACGTG TTCCACTTCC CGGACGGCAA CGCCACGCTG
ACACGCTTAA TGGTGCGCCA TCTGATCCCG GCGGTAGCGC CTGGCGGTAA GGACATGAAT
GACATAGTGC TGGCGAAGTT CGACTACAGC CAGCTTGACC GGGCGGAGTC ACCGGTAAAA
CTGCGCTTGA ACAGCACCGG GCTGCACGCG GCTAACGTCG GCGACAAGGT CGAAGTGACC
TACATGACCG GCGAGAAAAT GACCAAGGTG CGCGCCGGGC AGGTAGTGAT GGCCGGCTAC
AATATGATGA TCCCTTATCT GGTGCCGGAA ATGTCGCCGG AGCAGCAACT GGCGCTGAAG
CAGAACGTCA AGTCGCCGCT GGTGTACAGC AAAGTGGTGA TCCGTAACTG GCAGTCGTTT
ATTAAACTGG GCGTGCATGA AGTTTACTCG CCAACGGCGC CTTATTGCCG TGTGAAGCTG
GATTATCCGG TGAGCATGGG CGGCTACCAG CATCCACGCG ATCCGAACCA GCCGATTGGC
CTGCACATGG TGTATGTGCC GACGCTGGCG GGCAGCGGGT TAAGCCCACG CGAGCAGTCG
CGCAAGGGCC GTGCCTTGCT GTTGGGCACG CCGTTTGAAG TGCATGAGCA GATGATCCGT
GAGCAGTTGC AGGGCATGCT CGGTTCCGCC GGTTTTGATC ATCAGCGTGA TATTGAAGCG
ATCACCGTTA ACCGCTGGTC GCACGGCTAT TCCTACTTCC TCAACGGGCT GTTTGACGAT
GAGGACGAGG CGAAGAAAAT CATTGAGACG GCGCGTAAGC CGATCGGCAA AATTGTGATT
GCCAACTCGG ATTCAGACTG GAGTCCGTAC GCCAACTCGG CGATCGATCA GGCGTGGCGC
GCGGTTAATG AACTGGCCTT CGGCCAGGTT GCCGCCAAGG AGGGAGCATG A
 
Protein sequence
MSITRRDFLN GVAITIAAGL TPMQILRASP QTANQTLYYP PTLTGLRGNH PGSFEHAHQL 
GRDGKAFDFA SIPATEEFDL VVVGAGISGL AAACFWQQMK GQQQRILLID NHDDFGGHAK
RNEFSSENGT ILGYGGSESL QSPRSNFSPV AMRLLQKLGV SIDNLEKAFD KTFYPDLNLS
RGVYFDRKNF GVDKVVNGDP GRMVADDIPH DRLNGRSYEA FIGDFPLPES DRQALIALHT
VDKDYLPEMS QEQKSEWLDK HSYTEFLRDK VGLSEMAIRY FQQTTSDFQA VGIDATSCSD
ARICDLPGLN GMNLPPLDEE SQADLDDPYV FHFPDGNATL TRLMVRHLIP AVAPGGKDMN
DIVLAKFDYS QLDRAESPVK LRLNSTGLHA ANVGDKVEVT YMTGEKMTKV RAGQVVMAGY
NMMIPYLVPE MSPEQQLALK QNVKSPLVYS KVVIRNWQSF IKLGVHEVYS PTAPYCRVKL
DYPVSMGGYQ HPRDPNQPIG LHMVYVPTLA GSGLSPREQS RKGRALLLGT PFEVHEQMIR
EQLQGMLGSA GFDHQRDIEA ITVNRWSHGY SYFLNGLFDD EDEAKKIIET ARKPIGKIVI
ANSDSDWSPY ANSAIDQAWR AVNELAFGQV AAKEGA