Gene EcSMS35_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1718 
SymbolansP 
ID6143914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1725255 
End bp1726754 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content53% 
IMG OID641616594 
ProductL-asparagine permease 
Protein accessionYP_001743772 
Protein GI170681095 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00274006 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAC ACGACACCGA CACTTCAGAT CAGCACGCCG CGAAACGCCG CTGGCTTAAT 
GCCCACGAAG AGGGGTATCA CAAAGCGATG GGCAATCGCC AGGTTCAGAT GATCGCCATT
GGCGGCGCGA TTGGCACCGG CTTGTTTTTA GGTGCAGGAG CCCGACTGCA AATGGCGGGG
CCAGCACTGG CACTGGTCTA TTTAATCTGC GGCCTGTTTT CATTTTTTAT TCTGCGTGCA
TTGGGTGAGC TGGTGCTACA CCGCCCTTCC AGTGGCAGTT TTGTCTCTTA TGCCCGTGAG
TTTTTAGGAG AGAAAGCCGC TTACGTTGCA GGCTGGATGT ACTTCATCAA CTGGGCGATG
ACAGGGATTG TCGATATCAC CGCCGTCGCT CTGTATATGC ATTACTGGGG TGCGTTTGGC
GGCGTGCCGC AGTGGGTCTT TGCGCTCGCT GCGCTGACCA TCGTTGGCAC CATGAATATG
ATTGGTGTGA AATGGTTTGC GGAGATGGAG TTCTGGTTTG CGCTTATTAA AGTGCTCGCC
ATCGTGACCT TTTTGGTCGT GGGTACAGTG TTCCTCGGTA GCGGTCAGCC GCTGGATGGC
AACACCACTG GCTTTCATTT AATTACCGAT AATGGCGGCT TCTTCCCCCA CGGTTTGCTG
CCTGCGCTGG TGTTGATTCA GGGCGTAGTG TTTGCTTTTG CCTCTATTGA AATGGTGGGT
ACAGCGGCCG GAGAATGTAA AGATCCGCAG ACCATGGTGC CTAAAGCGAT TAACAGTGTG
ATTTGGCGTA TTGGCCTGTT TTACGTCGGC TCCGTGGTGT TGCTGGTTAT GCTATTGCCG
TGGAGCGCGT ATCAGGCGGG GCAAAGTCCG TTCGTGACGT TTTTCTCTAA ACTGGGTGTG
CCATATATCG GCAGCATTAT GAACATTGTG GTGCTGACCG CTGCCCTCTC CAGCCTGAAT
TCAGGTCTGT ACTGCACCGG ACGCATTCTG CGCTCAATGG CGATGGGCGG TTCCGCACCG
AGTTTTATGG CGAAAATGAG TCGTCAGCAT GTGCCGTATG CCGGGATTCT GGCGACACTG
GTTGTGTATG TCGTCGGCGT ATTCCTCAAC TATCTGGTGC CGTCGCGCGT ATTTGAGATT
GTGTTGAACT TCGCGTCGCT GGGAATCATC GCTTCATGGG CGTTTATCAT CGTGTGCCAG
ATGCGCCTGC GTAAAGCGAT TAAAGAAGGC AAAGCAGCGG ATGTCAGTTT TAAACTGCCT
GGCGCGCCCT TCACTTCCTG GCTGACATTA CTGTTTTTAC TGAGTGTTCT GGTGCTGATG
GCGTTCGATT ACCCGAACGG GACTTATACT ATCGCGGCGC TACCGATTAT CGGTATTCTG
CTGGTTATAG GCTGGTTTGG TGTGCGCAAA CGCGTTGCTG AAATTCACAG CACTGCGCCA
GTCGTCGAGG AAGATGAAGA ACAACAGGAA ATTGTGTTTA AGCCTGAAAC GGCGAGTTAA
 
Protein sequence
MSKHDTDTSD QHAAKRRWLN AHEEGYHKAM GNRQVQMIAI GGAIGTGLFL GAGARLQMAG 
PALALVYLIC GLFSFFILRA LGELVLHRPS SGSFVSYARE FLGEKAAYVA GWMYFINWAM
TGIVDITAVA LYMHYWGAFG GVPQWVFALA ALTIVGTMNM IGVKWFAEME FWFALIKVLA
IVTFLVVGTV FLGSGQPLDG NTTGFHLITD NGGFFPHGLL PALVLIQGVV FAFASIEMVG
TAAGECKDPQ TMVPKAINSV IWRIGLFYVG SVVLLVMLLP WSAYQAGQSP FVTFFSKLGV
PYIGSIMNIV VLTAALSSLN SGLYCTGRIL RSMAMGGSAP SFMAKMSRQH VPYAGILATL
VVYVVGVFLN YLVPSRVFEI VLNFASLGII ASWAFIIVCQ MRLRKAIKEG KAADVSFKLP
GAPFTSWLTL LFLLSVLVLM AFDYPNGTYT IAALPIIGIL LVIGWFGVRK RVAEIHSTAP
VVEEDEEQQE IVFKPETAS