Gene EcSMS35_0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0006 
SymbolagcS 
ID6146281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp6415 
End bp7845 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID641614907 
Productamino acid carrier protein 
Protein accessionYP_001742123 
Protein GI170680075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATT TTTTCTCCTT TATTAACAGC GTCCTTTGGG GATCGGTAAT GATTTACCTG 
CTCTTCGGCG CAGGTTGTTG GTTCACTTTT CGCACCGGAT TTGTGCAGTT TCGCTACATC
CGCCAGTTTG GCAAAAGTCT TAAAAATAGC ATTCATCCAC AGCCAGGCGG TTTAACCTCA
TTTCAGTCAT TGTGTACCAG TCTTGCGGCG CGCGTGGGTA GCGGCAACCT GGCTGGCGTT
GCGCTGGCTA TTACCGCCGG TGGACCAGGT GCCGTCTTCT GGATGTGGGT TGCCGCGTTT
ATCGGCATGG CGACCTCGTT TGCCGAATGT TCCCTTGCAC AACTTTATAA AGAACGTGAC
GTTAATGGGC AGTTTCGTGG CGGACCGGCA TGGTATATGG CGCGCGGGCT GGGGATGCGC
TGGATGGGCG TTCTGTTCGC CCTCTTTTTG CTCATCGCCT ACGGCATAAT TTTCAGCGGA
GTTCAGGCGA ACGCCGTTGC CCGAGCCCTG AGTTTTTCTT TTGATTTTCC TCCGCTGGTG
ACAGGCATTA TTCTCGCTGT CTTTGCTCTG CTGGCAATCA CTCGCGGTCT TCATGGCGTC
GCCCGGCTCA TGCAGGGGTT TGTCCCGTTG ATGGCGATAA TCTGGGTACT GACCAGCCTG
GTAATTTGCG TAATGAATAT CGGGCAACTT CCCCACGTCA TTTGGTCTAT TTTTGAGAGT
GCTTTTGGCT GGCAGGAAGC GGCAGGCGGC GCGGCGGGAT ATACCTTAAG CCAGGCGATT
ACTAACGGTT TTCAGCGCAG TATGTTTTCC AATGAGGCGG GAATGGGGTC GACGCCAAAC
GCGGCAGCGG CAGCGGCGTC CTGGCCTCCG CATCCCGCAG CGCAAGGAAT TGTCCAGATG
ATTGGCATTT TTATCGACAC CCTGGTCATC TGTACGGCAA GCGCCATGCT GATATTACTG
GCGGGTAACG GCACAACCTA CATGCCGCTG GAAGGTATTC AGCTTATCCA GAAGGCGATG
CGGGTGTTAA TGGGTTCCTG GGGTGCTGAG TTTGTTACCC TCGTGGTTAT TCTGTTTGCC
TTCAGCTCCA TCGTTGCCAA CTACATTTAT GCCGAAAACA ATCTCTTCTT TTTACGCCTG
AACAACCCTA AAGCGATCTG GTGTCTGCGG ATCTGCACCT TCGCAACGGT CATCGGCGGC
ACCTTGCTAA GTCTTCCGCT GATGTGGCAA CTGGCAGATA TCATAATGGC CTGCATGGCT
ATTACCAATT TGACCGCCAT ATTACTGCTC TCGCCTGTGG TTCATACCAT TGCCAGTGAT
TATCTACGCC AGCGTAAACT CGGCGTGCGC CCGGTGTTTG ATCCGTTGCG TTATCCGGAG
ATCGGTCGCC AGCTTTCTCC AGACGCGTGG GATGACGTTT CGCAGGAGTA A
 
Protein sequence
MPDFFSFINS VLWGSVMIYL LFGAGCWFTF RTGFVQFRYI RQFGKSLKNS IHPQPGGLTS 
FQSLCTSLAA RVGSGNLAGV ALAITAGGPG AVFWMWVAAF IGMATSFAEC SLAQLYKERD
VNGQFRGGPA WYMARGLGMR WMGVLFALFL LIAYGIIFSG VQANAVARAL SFSFDFPPLV
TGIILAVFAL LAITRGLHGV ARLMQGFVPL MAIIWVLTSL VICVMNIGQL PHVIWSIFES
AFGWQEAAGG AAGYTLSQAI TNGFQRSMFS NEAGMGSTPN AAAAAASWPP HPAAQGIVQM
IGIFIDTLVI CTASAMLILL AGNGTTYMPL EGIQLIQKAM RVLMGSWGAE FVTLVVILFA
FSSIVANYIY AENNLFFLRL NNPKAIWCLR ICTFATVIGG TLLSLPLMWQ LADIIMACMA
ITNLTAILLL SPVVHTIASD YLRQRKLGVR PVFDPLRYPE IGRQLSPDAW DDVSQE