Gene Ent638_0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0838 
SymboltauA 
ID5111186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp935781 
End bp936743 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content58% 
IMG OID640491014 
Producttaurine transporter substrate binding subunit 
Protein accessionYP_001175573 
Protein GI146310499 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.467754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CATCGCGTAT CACACTACTC GGCGCTCTGG CGCTGTGGGC ATTTCAGGCG 
CAGGCGGTTG ATGTCACCGT CGCGTATCAA ACTTCCGCGG AACCGGCGAA AGTCGCGCAG
GCGGATGGGA CGTTTGCGAA AGAAAGCGGC GCGAAAGTGG ACTGGCGTAA GTTCGACAGC
GGCGCAAGTA TTGTGCGGGC ATTGGCGTCC GGCGATGTGC AGATCGGGAA TCTCGGCTCC
AGCCCGCTGG CGGTTGCTGC GAGCCAGCAA GTGCCCATTG AAGTGTTTCT TCTTGCCTCG
CAGCTCGGGA ATTCCGAAGC GCTGGTGGTG AAGAAAGGGA TCACCAAACC CGAAGATTTG
ATCGGCAAAC GTATCGCCGT GCCGTTTATC TCGACCACCC ACTACAGCCT GCTGGCGGCG
CTCAAACACT GGGGTATCAA GCCGGGTCAG GTGGAAATCC TTAACCTGCA ACCGCCTGCG
ATAATTGCAG CCTGGCAGCG TGGAGATATT GATGGCGCGT ATGTCTGGGC ACCGGCGGTG
AACGCGCTGG AAAAAGACGG CACGGTGCTG ACCGATTCCG AAAAAGTGGC AGAGTGGGGC
GCGCCAACGC TCGACGTGTG GGTGGTGCGT AAAGACTTTG CCGAGAAACA TCCTGACGTG
GTGAAAGCCT TTGCGAAAAG CGCCATCGAT GCGCAACAGC CCTACATTGC CAATCCCGAT
GAATGGCTGA AACAGCCCGC CAATCTGGAA AAACTCTCGC GTCTCAGCGG CGTGCCAGAA
GCGGATGTGC CGGGTCTGGT CAAGGGCAAT ACCTATCTGA CGCCCGCGCA GCAGGTCCAG
CAGCTTTCTG GTCCGGTGAA TAAAGCGATT ATCGACACCG CCGGGTTCCT GAAAGAGCAG
GGCAAAGTGC CTGCGGTGGC GGCGGATTAT AGCCAGTTCG TGACCGATCG CTTTGTGAAA
TAA
 
Protein sequence
MAISSRITLL GALALWAFQA QAVDVTVAYQ TSAEPAKVAQ ADGTFAKESG AKVDWRKFDS 
GASIVRALAS GDVQIGNLGS SPLAVAASQQ VPIEVFLLAS QLGNSEALVV KKGITKPEDL
IGKRIAVPFI STTHYSLLAA LKHWGIKPGQ VEILNLQPPA IIAAWQRGDI DGAYVWAPAV
NALEKDGTVL TDSEKVAEWG APTLDVWVVR KDFAEKHPDV VKAFAKSAID AQQPYIANPD
EWLKQPANLE KLSRLSGVPE ADVPGLVKGN TYLTPAQQVQ QLSGPVNKAI IDTAGFLKEQ
GKVPAVAADY SQFVTDRFVK