Gene ECH74115_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4669 
SymbolargD 
ID6968431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4311946 
End bp4313166 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID643388373 
Productbifunctional N-succinyldiaminopimelate-aminotransferase/ acetylornithine transaminase protein 
Protein accessionYP_002272801 
Protein GI209396810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4992] Ornithine/acetylornithine aminotransferase 
TIGRFAM ID[TIGR00707] acetylornithine and succinylornithine aminotransferases
[TIGR03246] succinylornithine transaminase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.160127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.232002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG AACAAACAGC AATTACACGC GCGACTTTCG ATGAAGTGAT CCTGCCGATT 
TATGCTCCGG CAGAGTTCAT TCCGGTAAAA GGTCAGGGCA GCCGAATCTG GGATCAGCAA
GGCAAGGAGT ATGTCGATTT CGCGGGTGGC ATTGCAGTTA CGGCGTTGGG CCATTGCCAT
CCTGCGCTGG TGAACGCGTT AAAAACCCAG GGCGAAACTC TGTGGCATAT CAGTAACGTT
TTCACCAATG AACCGGCGCT GCGTCTTGGG CGTAAACTGA TTGAGGCGAC GTTTGCCGAA
CGCGTGGTGT TTATGAACTC CGGCACGGAA GCTAACGAAA CCGCCTTTAA ACTGGCACGC
CATTACGCCT GCGTGCGTCA TAGCCCGTTC AAAACCAAAA TTATTGCCTT CCATAACGCT
TTTCATGGTC GCTCGCTGTT TACCGTTTCG GTGGGTGGGC AGCCAAAATA TTCCGACGGC
TTTGGGCCAA AACCATCAGA CATCATCCAC GTTCCCTTTA ACGATCTCCA CGCAGTGAAA
GCGGTGATGG ATGATCACAC CTGTGCGGTG GTGGTTGAGC CGATCCAGGG CGAGGGCGGT
GTGACGGCAG CGACGCCAGA GTTTTTGCAG GGCTTGCGTG AGTTGTGCGA TCAACATCAG
GCATTATTGG TGTTTGATGA AGTACAGTGC GGGATGGGGC GGACTGGCGA TTTGTTTGCT
TACATGCACT ACGGCGTGAC GCCGGATATT CTGACTTCTG CGAAAGCGTT AGGCGGCGGC
TTCCCGATTA GCGCCATGCT GACCACGGCG GAAATTGCTT CTGCGTTTCA TCCTGGTTCT
CACGGTTCCA CCTACGGCGG TAATCCTCTG GCCTGTGCAG TAGCGGGCGC GGCGTTTGAT
ATCATTAATA CCCCTGAAGT GCTGGAAGGC ATTCAGGCGA AACGCCAGCG TTTTGTTGAC
CATTTGCAGA AGATCGATCA GCAGTACGAT GTGTTTAGCG ATATTCGCGG TATGGGGCTG
TTAATTGGCG CGGAGCTGAA ACCACAGTAC AAAGGTCAGG CGCGTGATTT CCTGTATGCG
GGCGCAGAGG CTGGCGTAAT GGTGCTGAAT GCCGGACCGG ATGTGATGCG TTTTGCGCCG
TCGCTGGTGG TGGAAGATGC GGATATCGAT GAAGGGATGC AACGTTTCGC CCACGCGGTG
GCGAAGGTGG TTGGGGCGTA A
 
Protein sequence
MAIEQTAITR ATFDEVILPI YAPAEFIPVK GQGSRIWDQQ GKEYVDFAGG IAVTALGHCH 
PALVNALKTQ GETLWHISNV FTNEPALRLG RKLIEATFAE RVVFMNSGTE ANETAFKLAR
HYACVRHSPF KTKIIAFHNA FHGRSLFTVS VGGQPKYSDG FGPKPSDIIH VPFNDLHAVK
AVMDDHTCAV VVEPIQGEGG VTAATPEFLQ GLRELCDQHQ ALLVFDEVQC GMGRTGDLFA
YMHYGVTPDI LTSAKALGGG FPISAMLTTA EIASAFHPGS HGSTYGGNPL ACAVAGAAFD
IINTPEVLEG IQAKRQRFVD HLQKIDQQYD VFSDIRGMGL LIGAELKPQY KGQARDFLYA
GAEAGVMVLN AGPDVMRFAP SLVVEDADID EGMQRFAHAV AKVVGA