Gene Spro_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3004 
Symbol 
ID5603882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3303759 
End bp3305264 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content58% 
IMG OID640938545 
Producthypothetical protein 
Protein accessionYP_001479233 
Protein GI157371244 
COG category[S] Function unknown 
COG ID[COG3517] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03355] type VI secretion protein, EvpB/VC_A0108 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACT CGCCTCAGCA ACAAAAAGCG CTGCAGACCA GCGAAGCTTT CTCCAGCGAT 
GAATTCAGCG CGCTGCTGAA CAAAGAGTTT CGTCCCAAGA CCGATCAGGC CAAGGAAGCG
GTAGAAAATG CGGTAAAAAC CCTGGCGCAG CAGGCGCTGG AAAACACCGT GACGGTCTCC
TCTGACGCCT ATCGCACCAT TCAGGCGCTG ATCGCCGAAA TCGACGAAAA ACTGTCGCAA
CAGGTTAACC AGATTATCCA CCACGCCGAA TTCCAGAAGC TGGAAGGCGC CTGGCATGGC
CTGCACTATC TGGTCAACAA CTCCGAAACC GATGAAATGC TGAAAATCCG CTTTATGAGC
ATTTCCAAGC AGGAGCTGGG CCGCACGCTG AAGCGCCATA AAGGTGTGGG CTGGGATCAG
AGCCCCATCT TCAAGAAAGT GTACGAGGAA GAGTACGGCC AGTTCGGCGG CGAACCCTTC
GGTTGCCTGG TGGGCGATTA CTACTTCGAC CACAGCCCAC AGGACGTAGA GTTGCTGGGT
GAGATGGCGA AAATCAGTGC CGCCTCCCAC TGCCCCTTCA TCGCCGGTAC CGCACCGAGC
GTGATGCAGA TGGAATCCTG GCAGGAGCTG TCCAACCCAC GCGATTTGAC CAAGATCTTC
CAAAATACCG AATACGCCGC CTGGCGCAGC CTGCGTGAAT CGGAAGATGC CCGCTATCTG
GGCCTGGTAA TGCCGCGATT CCTGGCGCGC CTGCCGTATG GCATCCGAAC CAATCCGGTT
GACGAGTTCG ACTTTGAGGA AGAAACCGAC GGCGCGAACC ACGGTAACTA CACCTGGACC
AACGCCGCCT ACGCCATGGC CGCCAACATC AACCGTTCGT TCAAAGAGTT CGGTTGGTGT
ACCGCGATCC GTGGGGTGGA GTCTGGCGGT GCGGTAGAGA ACCTGCCGTG CCACACCTTC
CCAAGCGACG ACGGCGGCGT GGACATGAAG TGCCCGACCG AGATCGCCAT CAGCGATCGT
CGCGAGGCCG AGCTGGCCAA GAACGGCTTC ATGCCGTTGG TACACCGGAA AAACTCCGAC
TTTGCCGCCT TTATCGGTGC CCAGTCGCTG CAAAAGCCGG CCGAATACTA CGACGCCGAT
GCATCTGCCA ATGCTCAACT CTCCGCCCGT CTGCCTTATC TGTTCGCCTG CTGCCGCTTC
GCGCATTACC TGAAGTGCAT CGTCCGTGAC AAGATCGGTT CTTTCCGCGA GCGCGACGAT
ATGGAACGCT GGTTAAACGA CTGGATCATG AACTACGTGG ATGGCGATCC GGCCAACTCC
TCGCAGGAAA CCAAGTCGCG CAAACCGCTG GCCGCTGCGG AAGTTCAGGT GGAAGAGATC
GAAGACAACC CGGGCTACTA CAGCGCCAAG TTCTTCCTGC GCCCACATTA CCAATTGGAA
GGCTTGACCG TCTCCCTGCG TCTGGTATCG AAACTGCCCT CACTGAAGCA GAACGACGCA
TCCTGA
 
Protein sequence
MSNSPQQQKA LQTSEAFSSD EFSALLNKEF RPKTDQAKEA VENAVKTLAQ QALENTVTVS 
SDAYRTIQAL IAEIDEKLSQ QVNQIIHHAE FQKLEGAWHG LHYLVNNSET DEMLKIRFMS
ISKQELGRTL KRHKGVGWDQ SPIFKKVYEE EYGQFGGEPF GCLVGDYYFD HSPQDVELLG
EMAKISAASH CPFIAGTAPS VMQMESWQEL SNPRDLTKIF QNTEYAAWRS LRESEDARYL
GLVMPRFLAR LPYGIRTNPV DEFDFEEETD GANHGNYTWT NAAYAMAANI NRSFKEFGWC
TAIRGVESGG AVENLPCHTF PSDDGGVDMK CPTEIAISDR REAELAKNGF MPLVHRKNSD
FAAFIGAQSL QKPAEYYDAD ASANAQLSAR LPYLFACCRF AHYLKCIVRD KIGSFRERDD
MERWLNDWIM NYVDGDPANS SQETKSRKPL AAAEVQVEEI EDNPGYYSAK FFLRPHYQLE
GLTVSLRLVS KLPSLKQNDA S