Gene EcHS_A2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2992 
Symbol 
ID5591215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3004168 
End bp3005397 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content45% 
IMG OID640922113 
Productserine transporter family protein 
Protein accessionYP_001459616 
Protein GI157162298 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00814] serine transporter 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.000482525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATA TTTGGTCTAA AGAAGAAACT CTGTGGAGTT TCGCGCTCTA CGGCACAGCC 
GTTGGTGCAG GCACGCTCTT CCTTCCTATT CAGTTAGGTT CGGCAGGGGC TGTGGTCCTG
TTTATTACTG CTCTGGTCGC CTGGCCTTTA ACATATTGGC CACATAAAGC CTTATGCCAG
TTCATCCTCT CATCGAAAAC ATCAGCAGGT GAAGGGATAA CGGGCGCAGT AACACACTAC
TATGGCAAGA AGATTGGTAA TCTGATTACC ACGCTGTACT TCATCGCCTT TTTTGTCGTC
GTGTTGATAT ATGCAGTGGC AATTACCAAC TCACTTACGG AACAGCTGGC AAAGCATATG
GTTATTGATC TTCGCATCCG TATGTTGGTG AGTCTGGGTG TTGTATTAAT CCTGAATCTC
ATTTTTCTGA TGGGACGCCA TGCAACCATT CGGGTAATGG GTTTTTTGGT ATTCCCATTG
ATTGCCTATT TCTTATTTCT TTCCATTTAC CTTGTCGGTA GTTGGCAACC TGATCTATTA
ACGACCCAGG TAGAGTTCAA TCAGAATACC CTTCACCAGA TATGGATATC GATTCCCGTG
ATGGTTTTCG CCTTTAGCCA TACGCCCATT ATTTCTACGT TTGCCATAGA CAGACGTGAA
AAATATGGCG AACACGCTAT GGATAAATGC AAAAAAATTA TGAAAGTCGC TTATCTCATC
ATCTGCATAA GTGTACTGTT CTTTGTCTTT AGCTGCCTGC TTTCTATTCC ACCTTCGTAT
ATTGAAGCGG CTAAAGAAGA AGGGGTCACC ATTTTATCGG CGCTTTCTAT GCTGCCGAAC
GCCCCAGCAT GGCTGTCAAT TTCCGGGATT ATTGTCGCAG TAGTTGCGAT GTCGAAATCA
TTCCTGGGTA CGTACTTTGG CGTTATTGAA GGTGCCACGG AGGTCGTCAA AACAACACTA
CAGCAGGTTG GTGTAAAGAA AAGTCGTGCA TTTAACCGCG CACTATCAAT TATGTTGGTA
TCGCTGATCA CCTTCATTGT TTGTTGCATT AACCCGAACG CGATTTCGAT GATTTACGCG
ATCAGCGGCC CGCTCATTGC CATGATACTT TTCATCATGC CTACGTTGTC AACGTATCTG
ATCCCGGCGC TTAAACCCTG GCGTTCCATC GGAAATCTGA TTACGCTGAT CGTGGGTATC
CTGTGCGTAT CGGTAATGTT CTTTAGCTGA
 
Protein sequence
MSNIWSKEET LWSFALYGTA VGAGTLFLPI QLGSAGAVVL FITALVAWPL TYWPHKALCQ 
FILSSKTSAG EGITGAVTHY YGKKIGNLIT TLYFIAFFVV VLIYAVAITN SLTEQLAKHM
VIDLRIRMLV SLGVVLILNL IFLMGRHATI RVMGFLVFPL IAYFLFLSIY LVGSWQPDLL
TTQVEFNQNT LHQIWISIPV MVFAFSHTPI ISTFAIDRRE KYGEHAMDKC KKIMKVAYLI
ICISVLFFVF SCLLSIPPSY IEAAKEEGVT ILSALSMLPN APAWLSISGI IVAVVAMSKS
FLGTYFGVIE GATEVVKTTL QQVGVKKSRA FNRALSIMLV SLITFIVCCI NPNAISMIYA
ISGPLIAMIL FIMPTLSTYL IPALKPWRSI GNLITLIVGI LCVSVMFFS