Gene ECH74115_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1983 
SymbolabgT 
ID6970902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1875421 
End bp1876953 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content50% 
IMG OID643385907 
Productputative aminobenzoyl-glutamate transporter 
Protein accessionYP_002270396 
Protein GI209399301 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2978] Putative p-aminobenzoyl-glutamate transporter 
TIGRFAM ID[TIGR00819] p-Aminobenzoyl-glutamate transporter family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.328307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGA GTATGTCATC CATACCGTCG TCCTCCCAAT CCGGGAAGCT CTATGGCTGG 
GTCGAAAGAA TTGGTAACAA GGTTCCCCAT CCTTTTTTGC TCTTTATCTA TTTGATTATC
GTACTCATGG TGACGACGGC AATTTTGTCG GCCTTTGGCG TCAGTGCGAA AAACCCGACC
GATGGTACGC CGGTGGTGGT GAAAAACCTG CTCAGTGTGG AAGGATTACA CTGGTTTTTA
CCCAATGTTA TTAAAAACTT TAGCGGTTTT GCTCCACTTG GTGCGATCCT GGCGCTGGTT
TTAGGTGCCG GTCTGGCGGA GCGCGTCGGC TTACTGCCAG CACTAATGGT TAAAATGGCA
TCGCATGTTA ATGCCCGCTA CGCCAGTTAT ATGGTGCTGT TTATTGCTTT TTTCAGCCAC
ATTTCTTCCG ATGCGGCGTT AGTGATCATG CCACCGATGG GTGCGCTGAT TTTTCTGGCG
GTGGGCAGGC ATCCAGTTGC AGGTTTACTG GCTGCCATTG CAGGCGTAGG TTGCGGCTTT
ACGGCTAATT TACTGATTGT CACAACCGAC GTGTTGCTGT CGGGGATCAG CACGGAAGCG
GCAGCTGCGT TCAATCCGCA AATGCACGTC AGTGTAATTG ATAACTGGTA TTTTATGGCC
AGCTCCGTAG TCGTACTGAC GATTGTTGGC GGCCTGATAA CCGACAAAAT CATCGAGCCA
CGGTTAGGTC AATGGCAGGG AAACAGCGAT GAGAAACTGC AGACATTGAC CGAAAGTCAG
CGTTTTGGTT TACGCATAGC AGGTGTCGTA TCGCTACTTT TTATTGCTGC GATTGCGCTG
ATGGTGATCC CGGAAAACGG GATATTGCGC GATCCGATTA ATCACACCGT GATGCCATCA
CCCTTTATTA AAGGTATCGT GCCACTGATC ATTCTTTTTT TCTTTGTTGT CTCGCTGGCT
TATGGCATCG CTACCCGCAC AATTCGACGT CAGGCGGATT TACCGCATTT AATGATTGAA
CCGATGAAAG AGATGGCGGG ATTTATCGTG ATGGTTTTTC CCCTCGCCCA GTTTGTCGCC
ATGTTTAACT GGAGCAACAT GGGGAAATTC ATCGCCGTGG GGCTGACCGA TATCCTGGAA
AGTTCAGGGC TTAGCGGCAT CCCGGCGTTT GTCGGTCTGG CGTTGCTTTC CTCTTTCTTA
TGCATGTTTA TCGCCAGCGG TTCCGCAATC TGGTCGATTC TGGCCCCCAT TTTCGTACCA
ATGTTTATGC TACTTGGCTT TCACCCGGCA TTTGCGCAAA TCCTCTTTCG TATTGCCGAC
TCATCCGTAT TGCCTTTAGC GCCAGTATCT CCTTTTGTTC CACTGTTTCT TGGATTCCTG
CAACGCTACA AACCAGACGC GAAACTGGGT ACTTACTATT CGTTAGTCTT GCCCTATCCG
CTTATCTTTT TGGTGGTATG GCTGCTGATG TTGCTGGCGT GGTATCTTGT GGGCCTGCCG
ATAGGTCCGG GTATTTACCC ACGTTTGTCT TAA
 
Protein sequence
MPMSMSSIPS SSQSGKLYGW VERIGNKVPH PFLLFIYLII VLMVTTAILS AFGVSAKNPT 
DGTPVVVKNL LSVEGLHWFL PNVIKNFSGF APLGAILALV LGAGLAERVG LLPALMVKMA
SHVNARYASY MVLFIAFFSH ISSDAALVIM PPMGALIFLA VGRHPVAGLL AAIAGVGCGF
TANLLIVTTD VLLSGISTEA AAAFNPQMHV SVIDNWYFMA SSVVVLTIVG GLITDKIIEP
RLGQWQGNSD EKLQTLTESQ RFGLRIAGVV SLLFIAAIAL MVIPENGILR DPINHTVMPS
PFIKGIVPLI ILFFFVVSLA YGIATRTIRR QADLPHLMIE PMKEMAGFIV MVFPLAQFVA
MFNWSNMGKF IAVGLTDILE SSGLSGIPAF VGLALLSSFL CMFIASGSAI WSILAPIFVP
MFMLLGFHPA FAQILFRIAD SSVLPLAPVS PFVPLFLGFL QRYKPDAKLG TYYSLVLPYP
LIFLVVWLLM LLAWYLVGLP IGPGIYPRLS