Gene ECH74115_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2059 
SymbolansP 
ID6970186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1954837 
End bp1956336 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content52% 
IMG OID643385971 
ProductL-asparagine permease 
Protein accessionYP_002270460 
Protein GI209396771 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0501349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00455719 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAC ACGACACCGA CACTTCAGAT CAACACGCCG CGAAACGCCG CTGGCTTAAT 
GCCCACGAAG AGGGGTATCA CAAAGCGATG GGCAATCGCC AGGTACAGAT GATCGCCATT
GGCGGCGCGA TTGGCACCGG CTTGTTTTTA GGTGCAGGAG CCCGACTGCA AATGGCGGGT
CCAGCACTGG CACTGGTTTA TTTAATTTGT GGCTTGTTTT CGTTTTTTAT TCTGCGTGCA
TTGGGTGAGC TGGTGCTACA CCGCCCTTCC AGTGGCAGTT TTGTCTCTTA TGCCCGTGAG
TTTTTGGGTG AGAAAGCCGC TTATGTTGCT GGCTGGATGT ACTTCATCAA CTGGGCGATG
ACGGGGATTG TTGATATTAC CGCCGTCGCT CTGTATATGC ATTACTGGGG TGCGTTTGGC
GGCGTGCCGC AGTGGGTCTT TGCGCTCGCT GCACTTACCA TCGTTGGCAC CATGAATATG
ATCGGTGTGA AATGGTTTGC GGAGATGGAG TTCTGGTTTG CGCTTATTAA AGTGCTCGCC
ATTGTGACCT TTTTGGTCGT GGGTACGGTG TTCCTCGGTA GTGGTCAGCC GCTGGATGGC
AACACCACTG GCTTTCATTT AATCACCGAT AATGGCGGCT TCTTCCCCCA CGGTTTGCTG
CCTGCGCTGG TGTTGATTCA GGGCGTAGTG TTTGCTTTTG CCTCCATTGA AATGGTGGGT
ACAGCTGCCG GAGAATGTAA AGATCCGCAG ACCATGGTGC CTAAAGCGAT TAACAGTGTG
ATTTGGCGTA TTGGCCTGTT TTACGTCGGC TCCGTGGTGT TGCTGGTTAT GTTATTGCCG
TGGAGCGCGT ATCAGGCGGG GCAAAGTCCG TTCGTGACGT TTTTCTCTAA ACTGGGTGTG
CCATATATCG GCAGCATTAT GAACATTGTG GTGCTGACCG CTGCCCTCTC CAGCCTGAAT
TCAGGTCTGT ACTGCACCGG ACGTATTCTG CGCTCAATGG CGATGGGCGG TTCCGCACCG
AGTTTTATGG CGAAAATGAG TCGTCAGCAT GTGCCGTATG CCGGGATTCT GGCGACACTG
GTTGTGTATG TCGTCGGCGT ATTCCTCAAC TATCTGGTGC CGTCGCGCGT ATTTGAGATT
GTGTTGAACT TCGCGTCGCT GGGAATCATC GCTTCATGGG CGTTTATCAT CGTGTGCCAG
ATGCGCCTGC GTAAAGCGAT TAAAGAAGGC AAAGCAGCGG ATGTCAGTTT TAAACTGCCT
GGCGCGCCCT TCACTTCCTG GCTGACATTA CTGTTTTTAC TGAGTGTCCT TGTGCTGATG
GCGTTCGATT ACCCGAACGG GACTTATACT ATCGCGGCGC TGCCGATTAT CGGTATTCTG
CTGGTTATAG GCTGGTTTGG TGTGCGCAAA CGCGTTGCTG AAATTCACAG CACTGCGCCA
GTCGTCGAGG AAGATGAAGA AAAACAGGAA ATTGTGTTTA AGCCTGAAAC GGCGAGCTAA
 
Protein sequence
MSKHDTDTSD QHAAKRRWLN AHEEGYHKAM GNRQVQMIAI GGAIGTGLFL GAGARLQMAG 
PALALVYLIC GLFSFFILRA LGELVLHRPS SGSFVSYARE FLGEKAAYVA GWMYFINWAM
TGIVDITAVA LYMHYWGAFG GVPQWVFALA ALTIVGTMNM IGVKWFAEME FWFALIKVLA
IVTFLVVGTV FLGSGQPLDG NTTGFHLITD NGGFFPHGLL PALVLIQGVV FAFASIEMVG
TAAGECKDPQ TMVPKAINSV IWRIGLFYVG SVVLLVMLLP WSAYQAGQSP FVTFFSKLGV
PYIGSIMNIV VLTAALSSLN SGLYCTGRIL RSMAMGGSAP SFMAKMSRQH VPYAGILATL
VVYVVGVFLN YLVPSRVFEI VLNFASLGII ASWAFIIVCQ MRLRKAIKEG KAADVSFKLP
GAPFTSWLTL LFLLSVLVLM AFDYPNGTYT IAALPIIGIL LVIGWFGVRK RVAEIHSTAP
VVEEDEEKQE IVFKPETAS