Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1786 |
Symbol | abgT |
ID | 6146842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1804675 |
End bp | 1806207 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616662 |
Product | putative aminobenzoyl-glutamate transporter |
Protein accession | YP_001743840 |
Protein GI | 170683317 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2978] Putative p-aminobenzoyl-glutamate transporter |
TIGRFAM ID | [TIGR00819] p-Aminobenzoyl-glutamate transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGA GTATGTCATC CATACCGTCG TCCTCCCAAT CCGGGAAGCT CTATGGCTGG GTCGAAAGAA TTGGTAACAA GGTTCCCCAT CCTTTTTTGC TCTTTATCTA TTTGATTATC GTACTCATGG TGACGACGGC AATTTTGTCA GCCTTTGGCG TCAGTGCGAA AAACCCGGCC GATGGTACGC CGGTGGTGGT GAAAAACCTG CTCAGTGTGG AAGGATTACA CTGGTTTTTA CCCAATGTTA TTAAAAACTT TAGCGGTTTT GCTCCACTTG GTGCGATCCT GGCGCTGGTT TTAGGTGCCG GTCTGGCGGA GCGCGTAGGC TTACTGCCAG CGCTAATGGT TAAAATGGCA TCGCATGTTA ATGCCCGCTA CGCCAGTTAT ATGGTGCTGT TTATTGCTTT TTTCAGCCAC ATTTCTTCCG ATGCGGCGTT AGTGATCATG CCACCGATGG GTGCGCTAAT TTTTCTGGCG GTGGGCAGGC ATCCAGTTGC AGGTTTACTG GCTGCCATTG CAGGCGTAGG TTGCGGCTTT ACGGCTAATT TACTGATTGT CACAACCGAC GTGTTGCTGT CGGGGATCAG CACGGAGGCT GCGGCTGCGT TCAATCCGCA AATGCACGTC AGTGTAATTG ATAACTGGTA TTTTATGGCC AGCTCCGTAG TCGTACTGAC GATTGTTGGC GGCCTGATAA CCGACAAAAT CATTGAGCCA CGTTTAGGTC AATGGCAGGG AAACAGCGAT GAGAAACTGC AGACATTGAC CGAAAGTCAG CGTTTTGGTT TACGCATAGC AGGTGTCGTA TCGCTACTTT TTATTGCTGC GATTGCGCTG ATGGTGATCC CGGAAAACGG GATATTGCGC GATCCGATTA ATCACACCGT GATGCCATCA CCCTTTATTA AAGGTATCGT GCCACTGATC ATTCTTTTTT TCTTTGTGGT CTCGCTGGCT TATGGCATCG CTACCCGCAC AATTCGACGT CAGGCGGATT TACCGCATTT AATGATTGAA CCGATGAAAG AGATGGCGGG ATTTATCGTG ATGGTTTTTC CCCTCGCCCA GTTTGTCGCC ATGTTTAACT GGAGCAACAT GGGGAAATTC ATCGCCGTGG GGCTGACCGA TATACTGGAA AGTTCAGGGC TTAGCGGCAT CCCGGCGTTT GTCGGTCTGG CGTTGCTTTC CTCTTTCTTA TGCATGTTTA TCGCCAGCGG TTCCGCAATC TGGTCGATTC TGGCCCCCAT TTTCGTACCG ATGTTTATGC TACTTGGCTT TCACCCGGCA TTTGCGCAAA TCCTCTTTCG TATTGCCGAC TCATCCGTAT TGCCTTTAGC GCCAGTATCT CCTTTTGTTC CACTGTTTCT TGGATTCCTG CAACGCTACA AACCAGACGC GAAACTGGGT ACTTACTATT CGTTAGTTTT GCCCTATCCA CTTATCTTTT TGGTGGTATG GCTGCTGATG TTGCTGGCGT GGTATCTTGT TGGTCTGCCG ATAGGTCCGG GGATTTACCC ACGTTTGTCT TAA
|
Protein sequence | MPMSMSSIPS SSQSGKLYGW VERIGNKVPH PFLLFIYLII VLMVTTAILS AFGVSAKNPA DGTPVVVKNL LSVEGLHWFL PNVIKNFSGF APLGAILALV LGAGLAERVG LLPALMVKMA SHVNARYASY MVLFIAFFSH ISSDAALVIM PPMGALIFLA VGRHPVAGLL AAIAGVGCGF TANLLIVTTD VLLSGISTEA AAAFNPQMHV SVIDNWYFMA SSVVVLTIVG GLITDKIIEP RLGQWQGNSD EKLQTLTESQ RFGLRIAGVV SLLFIAAIAL MVIPENGILR DPINHTVMPS PFIKGIVPLI ILFFFVVSLA YGIATRTIRR QADLPHLMIE PMKEMAGFIV MVFPLAQFVA MFNWSNMGKF IAVGLTDILE SSGLSGIPAF VGLALLSSFL CMFIASGSAI WSILAPIFVP MFMLLGFHPA FAQILFRIAD SSVLPLAPVS PFVPLFLGFL QRYKPDAKLG TYYSLVLPYP LIFLVVWLLM LLAWYLVGLP IGPGIYPRLS
|
| |