Gene Spro_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1738 
Symbol 
ID5604713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1915283 
End bp1916248 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content57% 
IMG OID640937270 
Productalkanesulfonate transporter substrate-binding subunit 
Protein accessionYP_001477970 
Protein GI157369981 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.159271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0882717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC TTTCTTCACT GGGCCGTTGG CTGGGCACCA GCGCATTGGC AGGCATCCTC 
TCATTGGCCT GGACCAATGC CGCCAGCGCA CAGGATCCGG CGCAGTTTCG CATTGGTTAC
CAAAAGGGAT CGGTCAGTCT GGTACTGGCA AAAACCCATC GGCTGTTGGA ACAGCGTTTT
CCCAACACCA AAATCAGTTG GATCGAATTC CCCGCCGGCC CGCAAATGCT TGAAGCGCTG
AACGTCGGCA GTATCGATCT GGGCAGTACC GGCGATATTC CGCCGATCTT CGCCCAGGCC
GCCGGAGCGG ACCTGCTGTA TGTTGGCGTA GAGCCACCAA AACCCAAGGC AGAAGTGATC
CTGGTGCCAG AAAACAGCCC GATCAAAACC GTCGCGGAGC TAAAGGGCCA CAAGGTGGCC
TTCCAGAAAG GCTCCAGCTC CCACAATCTG CTGCTGCGCT CATTGCAAAA AGCCGGGCTG
AAATTCACCG ATATTCAGCC CGTCTACCTG ACTCCGGCCG ATGCCCGCGC CGCCTTCCAG
CAGGGCAATG TCGATGCCTG GACAATTTGG GATCCCTACT ATTCCGCGGC CTTGTTGCAG
GGTGGCGTGC GGGTACTGGG TGACGGTACC GATTTGAATC AAACCGGCTC CTTCTATCTG
GCGGTGCGAA CTTATACCGA GGCCAATGGA CCCTTTATTC AACAGGTACT CGATACGCTG
ACCCAGGCTG ATGCGCTGAC CCTAAGCGAC CGTGCGCAAA GCGTCACGCT GCTGGCCAAT
GCCATGGGCC TGCCGGATAA AGTGATTTCG ACCTATTTGG ATCACCGCCC GCCCACCGCC
ATCAAACCTC TGGATGCGCA CACCATAGCC GCTCAGCAGC AAACGGCCGA TCTGTTTTAT
GCCAACCGCC TGGTGCCGGT GAAAGTCGAT ATTTCGCAAC GCATCTGGCA CCCAAGCGCA
CAATAA
 
Protein sequence
MKRLSSLGRW LGTSALAGIL SLAWTNAASA QDPAQFRIGY QKGSVSLVLA KTHRLLEQRF 
PNTKISWIEF PAGPQMLEAL NVGSIDLGST GDIPPIFAQA AGADLLYVGV EPPKPKAEVI
LVPENSPIKT VAELKGHKVA FQKGSSSHNL LLRSLQKAGL KFTDIQPVYL TPADARAAFQ
QGNVDAWTIW DPYYSAALLQ GGVRVLGDGT DLNQTGSFYL AVRTYTEANG PFIQQVLDTL
TQADALTLSD RAQSVTLLAN AMGLPDKVIS TYLDHRPPTA IKPLDAHTIA AQQQTADLFY
ANRLVPVKVD ISQRIWHPSA Q