Gene Spro_4566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4566 
SymboltauA 
ID5606991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5040562 
End bp5041557 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content59% 
IMG OID640940132 
Producttaurine transporter substrate binding subunit 
Protein accessionYP_001480787 
Protein GI157372798 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCA AACACTTTTC ATTACGCGGC GCGGCACTGC TGGTGTTATC ACTTGCTGCG 
GCTAACGCCT ATGCGGTGGA CGTGACCGTG GCGTACCAGA CTTCGGCGGA ACCCGCCAAG
GTGGCGCAGG CGGAAAACAG CTTCGCCAAA CAGTCTGGTG CCACCGTTGA CTGGCGTAAA
TTCGACAGCG GCTCCAGCGT GCTGCGTGCT TTGGCCTCCG GTGACGTGCA GATCGGTAAT
ATCGGTTCCA GCCCACTGGC GGTGGCCGCC AGCCAAAAAC TCCCTATCGA AGTCTTCCTG
ATCGCCTCCC AGCTCGGCAG CTCCGAAGCC CTGGTGGTGA AGAAAGAGAT CAAAACCCCG
CAGGATTTGA TCGGCAAGCG AATCGCCGTG CCCTTTATCT CAACCACTCA CTACAGCCTG
CTAGCCTCGC TCAAGCACTG GGGCATCAAG CCTGAGCAGG TCAAAATTCT CAATCTGCAA
CCGCCGGCCA TTGCCGCTGC CTGGCAGCGC GGTGACATCG ACGGAGCCTA CGTCTGGGCG
CCGGTAGTCA ATGAATTAGC CAAGCAGGGC AAGGTACTGA CCGATTCCGC CCAGGTTGGA
CAATGGGGCG CGCCGACGCT TGACGTCTGG GTGGTGCGCA AGGACTTTGC CGAAAAACAC
CCGGAAGTGG TGACCGCCTT TGCCGCCAGC GCGTTGAACG CCCAAAAAGC CTATCTGGCG
CAGCCGGATC AGTGGCTGAA GGATAAAGGC AATCTCAACA CGCTGTCCCG TTTGAGCGGC
GTGCCGGAAG AACAGATACC GGTGCTGGTG AAGGGCAATA CCTATTTGCC GGTGGCGGAG
CAAATAACCC AACTTGGCCA GCCGGTGGAC AAGGCTATCC GCGATACCGC CGAGTTCCTT
AAACAGCAGG GCAAAATTCC GCAGGTCGAC GGTGATTACA GTGCCTACGT CACCGATCGC
TTTGTGAAAC AGGTGCAGGC TGCGCCGCAG TCGTAA
 
Protein sequence
MASKHFSLRG AALLVLSLAA ANAYAVDVTV AYQTSAEPAK VAQAENSFAK QSGATVDWRK 
FDSGSSVLRA LASGDVQIGN IGSSPLAVAA SQKLPIEVFL IASQLGSSEA LVVKKEIKTP
QDLIGKRIAV PFISTTHYSL LASLKHWGIK PEQVKILNLQ PPAIAAAWQR GDIDGAYVWA
PVVNELAKQG KVLTDSAQVG QWGAPTLDVW VVRKDFAEKH PEVVTAFAAS ALNAQKAYLA
QPDQWLKDKG NLNTLSRLSG VPEEQIPVLV KGNTYLPVAE QITQLGQPVD KAIRDTAEFL
KQQGKIPQVD GDYSAYVTDR FVKQVQAAPQ S