Gene EcE24377A_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2552 
SymbolarnT 
ID5589233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2545765 
End bp2547417 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID640926210 
Product4-amino-4-deoxy-L-arabinose transferase 
Protein accessionYP_001463604 
Protein GI157154965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA 
CCGATCAGCA CGCGTCTGCT CTGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA
GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTCGAA
AAACCCATTG CCGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC
TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTACCGCGCT GGTGACCTGG
TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCTCTAC TCGCCACAGT AATTTATCTC
TCATTGTTTA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCATTC
TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAAAGGC
AAAAGCGCAG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT
TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTGCCAT GGGTAGCGAC GCAAAAACGC
TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTTATCA GTTGCGTACT GACGGTTCTC
CCTTGGGGAC TGGCGATAGC GCAGCGGGAG CCTGACTTCT GGCACTATTT TTTCTGGGTT
GAGCATATTC AACGCTTTGC ACTGGATGAT GCCCAACATA GAGCTCCGTT CTGGTACTAC
GTGCCGGTCA TCATTGCCGG TAGCCTGCCG TGGCTGGGAT TACTCCCCGG TGCACTGTAC
ACAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG
CCGCTGCTGT TTTTCTCCGT CGCTAAAGGT AAATTGCCCA CCTATATTCT TTCCTGCTTT
GCATCTCTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG
GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGCG TCACTGGCAT TATTGCCACA
TTTGTGGTCT CCCCGTGGGG ACCAATGAAC ACGCCGGTGT GGCAAACCTT CGAGAGCTAT
AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA
ACAAACGTCG AAAAGACCTG GCCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG
GTAGGATTTT CAATTCCTGA CAGAGTTATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG
ATGACACAAG AATCACTTCA GCCAAGCCGC TATATTCTTA CTGATAGCGT CGGTGTTGCC
GCAGGTCTGG CATGGAGCCT GCAACGCGAT GACATCATCA TGTATCGCCA GACAGGTGAG
TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGAT TTGTCAGCGG TGATGAGTTC
GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CTCTCGTGCT TTCGGTTGAC
CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG
CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
 
Protein sequence
MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE 
KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTTALVTW FTLRLWRDKR LALLATVIYL
SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWKG KSAGFLLLGI TCGMGVMTKG
FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLAIAQRE PDFWHYFFWV
EHIQRFALDD AQHRAPFWYY VPVIIAGSLP WLGLLPGALY TGWKNRKHSA TVYLLSWTIM
PLLFFSVAKG KLPTYILSCF ASLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT
FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWPFA ALCPLGLALL
VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE
LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE
RLVLIQYRPK