Gene ECH74115_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3398 
SymbolarnT 
ID6970723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3142783 
End bp3144435 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID643387206 
Product4-amino-4-deoxy-L-arabinose transferase 
Protein accessionYP_002271669 
Protein GI209400163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA 
CCGATCAGCA CGCGTCTGCT ATGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA
GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTTGAA
AAACCCATTG CCGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC
TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTGCCGCGCT GGTGACCTGG
TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCTCTAC TCGCCACAGT AATTTATCTC
TCATTGTTTA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCCTTC
TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAAAGGC
AAAAGCGCAG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT
TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTGCCAT GGGTAGCGAC GCAAAAACGC
TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTCATCA GTTGCGTACT GACGGTTCTC
CCCTGGGGAC TGACGATAGC GCAGCGGGAG CCTGACTTCT GGCACTATTT TTTCTGGGTT
GAGCATATTC AACGCTTTGC ACTGGATGAT GCCCAACATA GAGCTCCGTT CTGGTACTAC
GTGCCGGTCA TCATTGCCGG TAGTCTGCCG TGGCTGGGAT TACTCCCCGG TGCACTATAC
ACAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG
CCGCTGCTGT TTTTCTCCGT TGCTAAGGGT AAATTGCCCA CCTATATTCT TTCCTGCTTT
GCACCTTTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG
GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGCG TCACTGGCAT TATTGCCACA
TTTGTGGTCT CCCCCTGGGG ACCAATGAAC ACGCCGGTGT GGCAAACCTT CGAGAGCTAT
AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA
ACAAACGTCG AAAAGACCTG GTCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG
GTAGGATTTT CAATTCCTGA CAGAGTCATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG
ATGACACAAG AATCACTTCA GCCAAGCCGC TATATTCTTA CTGATAGCGT CGGTGTTGCC
GCAGGTCTGG CATGGAGTCT GCAACGCGAT GACATCATCA TGTATCGCCA GACAGGTGAG
TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGAT TTGTCAGCGG TGATGAGTTC
GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CACTCGTGCT TTCGGTTGAC
CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG
CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
 
Protein sequence
MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE 
KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTAALVTW FTLRLWRDKR LALLATVIYL
SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWKG KSAGFLLLGI TCGMGVMTKG
FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLTIAQRE PDFWHYFFWV
EHIQRFALDD AQHRAPFWYY VPVIIAGSLP WLGLLPGALY TGWKNRKHSA TVYLLSWTIM
PLLFFSVAKG KLPTYILSCF APLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT
FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWSFA ALCPLGLALL
VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE
LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE
RLVLIQYRPK