Gene EcSMS35_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1786 
SymbolabgT 
ID6146842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1804675 
End bp1806207 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content49% 
IMG OID641616662 
Productputative aminobenzoyl-glutamate transporter 
Protein accessionYP_001743840 
Protein GI170683317 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2978] Putative p-aminobenzoyl-glutamate transporter 
TIGRFAM ID[TIGR00819] p-Aminobenzoyl-glutamate transporter family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGA GTATGTCATC CATACCGTCG TCCTCCCAAT CCGGGAAGCT CTATGGCTGG 
GTCGAAAGAA TTGGTAACAA GGTTCCCCAT CCTTTTTTGC TCTTTATCTA TTTGATTATC
GTACTCATGG TGACGACGGC AATTTTGTCA GCCTTTGGCG TCAGTGCGAA AAACCCGGCC
GATGGTACGC CGGTGGTGGT GAAAAACCTG CTCAGTGTGG AAGGATTACA CTGGTTTTTA
CCCAATGTTA TTAAAAACTT TAGCGGTTTT GCTCCACTTG GTGCGATCCT GGCGCTGGTT
TTAGGTGCCG GTCTGGCGGA GCGCGTAGGC TTACTGCCAG CGCTAATGGT TAAAATGGCA
TCGCATGTTA ATGCCCGCTA CGCCAGTTAT ATGGTGCTGT TTATTGCTTT TTTCAGCCAC
ATTTCTTCCG ATGCGGCGTT AGTGATCATG CCACCGATGG GTGCGCTAAT TTTTCTGGCG
GTGGGCAGGC ATCCAGTTGC AGGTTTACTG GCTGCCATTG CAGGCGTAGG TTGCGGCTTT
ACGGCTAATT TACTGATTGT CACAACCGAC GTGTTGCTGT CGGGGATCAG CACGGAGGCT
GCGGCTGCGT TCAATCCGCA AATGCACGTC AGTGTAATTG ATAACTGGTA TTTTATGGCC
AGCTCCGTAG TCGTACTGAC GATTGTTGGC GGCCTGATAA CCGACAAAAT CATTGAGCCA
CGTTTAGGTC AATGGCAGGG AAACAGCGAT GAGAAACTGC AGACATTGAC CGAAAGTCAG
CGTTTTGGTT TACGCATAGC AGGTGTCGTA TCGCTACTTT TTATTGCTGC GATTGCGCTG
ATGGTGATCC CGGAAAACGG GATATTGCGC GATCCGATTA ATCACACCGT GATGCCATCA
CCCTTTATTA AAGGTATCGT GCCACTGATC ATTCTTTTTT TCTTTGTGGT CTCGCTGGCT
TATGGCATCG CTACCCGCAC AATTCGACGT CAGGCGGATT TACCGCATTT AATGATTGAA
CCGATGAAAG AGATGGCGGG ATTTATCGTG ATGGTTTTTC CCCTCGCCCA GTTTGTCGCC
ATGTTTAACT GGAGCAACAT GGGGAAATTC ATCGCCGTGG GGCTGACCGA TATACTGGAA
AGTTCAGGGC TTAGCGGCAT CCCGGCGTTT GTCGGTCTGG CGTTGCTTTC CTCTTTCTTA
TGCATGTTTA TCGCCAGCGG TTCCGCAATC TGGTCGATTC TGGCCCCCAT TTTCGTACCG
ATGTTTATGC TACTTGGCTT TCACCCGGCA TTTGCGCAAA TCCTCTTTCG TATTGCCGAC
TCATCCGTAT TGCCTTTAGC GCCAGTATCT CCTTTTGTTC CACTGTTTCT TGGATTCCTG
CAACGCTACA AACCAGACGC GAAACTGGGT ACTTACTATT CGTTAGTTTT GCCCTATCCA
CTTATCTTTT TGGTGGTATG GCTGCTGATG TTGCTGGCGT GGTATCTTGT TGGTCTGCCG
ATAGGTCCGG GGATTTACCC ACGTTTGTCT TAA
 
Protein sequence
MPMSMSSIPS SSQSGKLYGW VERIGNKVPH PFLLFIYLII VLMVTTAILS AFGVSAKNPA 
DGTPVVVKNL LSVEGLHWFL PNVIKNFSGF APLGAILALV LGAGLAERVG LLPALMVKMA
SHVNARYASY MVLFIAFFSH ISSDAALVIM PPMGALIFLA VGRHPVAGLL AAIAGVGCGF
TANLLIVTTD VLLSGISTEA AAAFNPQMHV SVIDNWYFMA SSVVVLTIVG GLITDKIIEP
RLGQWQGNSD EKLQTLTESQ RFGLRIAGVV SLLFIAAIAL MVIPENGILR DPINHTVMPS
PFIKGIVPLI ILFFFVVSLA YGIATRTIRR QADLPHLMIE PMKEMAGFIV MVFPLAQFVA
MFNWSNMGKF IAVGLTDILE SSGLSGIPAF VGLALLSSFL CMFIASGSAI WSILAPIFVP
MFMLLGFHPA FAQILFRIAD SSVLPLAPVS PFVPLFLGFL QRYKPDAKLG TYYSLVLPYP
LIFLVVWLLM LLAWYLVGLP IGPGIYPRLS