Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0006 |
Symbol | agcS |
ID | 6146281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 6415 |
End bp | 7845 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641614907 |
Product | amino acid carrier protein |
Protein accession | YP_001742123 |
Protein GI | 170680075 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGATT TTTTCTCCTT TATTAACAGC GTCCTTTGGG GATCGGTAAT GATTTACCTG CTCTTCGGCG CAGGTTGTTG GTTCACTTTT CGCACCGGAT TTGTGCAGTT TCGCTACATC CGCCAGTTTG GCAAAAGTCT TAAAAATAGC ATTCATCCAC AGCCAGGCGG TTTAACCTCA TTTCAGTCAT TGTGTACCAG TCTTGCGGCG CGCGTGGGTA GCGGCAACCT GGCTGGCGTT GCGCTGGCTA TTACCGCCGG TGGACCAGGT GCCGTCTTCT GGATGTGGGT TGCCGCGTTT ATCGGCATGG CGACCTCGTT TGCCGAATGT TCCCTTGCAC AACTTTATAA AGAACGTGAC GTTAATGGGC AGTTTCGTGG CGGACCGGCA TGGTATATGG CGCGCGGGCT GGGGATGCGC TGGATGGGCG TTCTGTTCGC CCTCTTTTTG CTCATCGCCT ACGGCATAAT TTTCAGCGGA GTTCAGGCGA ACGCCGTTGC CCGAGCCCTG AGTTTTTCTT TTGATTTTCC TCCGCTGGTG ACAGGCATTA TTCTCGCTGT CTTTGCTCTG CTGGCAATCA CTCGCGGTCT TCATGGCGTC GCCCGGCTCA TGCAGGGGTT TGTCCCGTTG ATGGCGATAA TCTGGGTACT GACCAGCCTG GTAATTTGCG TAATGAATAT CGGGCAACTT CCCCACGTCA TTTGGTCTAT TTTTGAGAGT GCTTTTGGCT GGCAGGAAGC GGCAGGCGGC GCGGCGGGAT ATACCTTAAG CCAGGCGATT ACTAACGGTT TTCAGCGCAG TATGTTTTCC AATGAGGCGG GAATGGGGTC GACGCCAAAC GCGGCAGCGG CAGCGGCGTC CTGGCCTCCG CATCCCGCAG CGCAAGGAAT TGTCCAGATG ATTGGCATTT TTATCGACAC CCTGGTCATC TGTACGGCAA GCGCCATGCT GATATTACTG GCGGGTAACG GCACAACCTA CATGCCGCTG GAAGGTATTC AGCTTATCCA GAAGGCGATG CGGGTGTTAA TGGGTTCCTG GGGTGCTGAG TTTGTTACCC TCGTGGTTAT TCTGTTTGCC TTCAGCTCCA TCGTTGCCAA CTACATTTAT GCCGAAAACA ATCTCTTCTT TTTACGCCTG AACAACCCTA AAGCGATCTG GTGTCTGCGG ATCTGCACCT TCGCAACGGT CATCGGCGGC ACCTTGCTAA GTCTTCCGCT GATGTGGCAA CTGGCAGATA TCATAATGGC CTGCATGGCT ATTACCAATT TGACCGCCAT ATTACTGCTC TCGCCTGTGG TTCATACCAT TGCCAGTGAT TATCTACGCC AGCGTAAACT CGGCGTGCGC CCGGTGTTTG ATCCGTTGCG TTATCCGGAG ATCGGTCGCC AGCTTTCTCC AGACGCGTGG GATGACGTTT CGCAGGAGTA A
|
Protein sequence | MPDFFSFINS VLWGSVMIYL LFGAGCWFTF RTGFVQFRYI RQFGKSLKNS IHPQPGGLTS FQSLCTSLAA RVGSGNLAGV ALAITAGGPG AVFWMWVAAF IGMATSFAEC SLAQLYKERD VNGQFRGGPA WYMARGLGMR WMGVLFALFL LIAYGIIFSG VQANAVARAL SFSFDFPPLV TGIILAVFAL LAITRGLHGV ARLMQGFVPL MAIIWVLTSL VICVMNIGQL PHVIWSIFES AFGWQEAAGG AAGYTLSQAI TNGFQRSMFS NEAGMGSTPN AAAAAASWPP HPAAQGIVQM IGIFIDTLVI CTASAMLILL AGNGTTYMPL EGIQLIQKAM RVLMGSWGAE FVTLVVILFA FSSIVANYIY AENNLFFLRL NNPKAIWCLR ICTFATVIGG TLLSLPLMWQ LADIIMACMA ITNLTAILLL SPVVHTIASD YLRQRKLGVR PVFDPLRYPE IGRQLSPDAW DDVSQE
|
| |