Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2411 |
Symbol | arnT |
ID | 6146714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2460287 |
End bp | 2461939 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617284 |
Product | 4-amino-4-deoxy-L-arabinose transferase |
Protein accession | YP_001744456 |
Protein GI | 170681813 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA CCGATCAGCA CGCGTCTGCT ATGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTCGAA AAACCCATTG CTGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTGCCGCGCT TGTGACCTGG TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCATTAC TCGCCGCAGT GATTTACCTC TCATTGTTCA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCCTTC TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAAAGGC AAAAGCGCGG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTTCCAT GGGTTGCAAC GCAAAAACGC TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTTATCA GTTGCGTACT GACGGTTCTC CCCTGGGGAC TGGCGATAGC GCAGCGGGAG CCTGACTTCT GGCACTATTT TTTCTGGGTT GAGCATATTC AACGCTTTGC AATGGATGAT GCTCAGCATA GAGCTCCGTT CTGGTACTAC TTGCCGGTCA TCATTGCCGG TAGCCTGCCG TGGCTGGGAT TACTCCCCGG AGCACTGTAC GCAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG CCGCTGCTGT TTTTCTCCGT CGCTAAAGGC AAATTGCCCA CCTATATTCT TTCCTGCTTT GCACCTCTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGTG TCACTGGCAT TATTGCTACG TTTGTGGTCT CCCCATGGGG ACCAATGAAC ACGCCAGTGT GGCAAACCTT CGAGAGCTAT AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA ACAAACGTCG AAAAGACCTG GTCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG GTAGGATTTT CAATTCCTGA CAGAGTCATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG ATGACACAAG AATCACTGCA GCCAAGCCGC TATATTCTTA CTGACAGCGT GGGCGTTGCC GCAGGTCTGG CATGGAGCCT GCAACGCGAT GACATCATCA TGTATCGCCA GACGGGTGAG TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGGT TTGTCAGCGG TGATGAGTTC GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CACTCGTGCT TTCGGTTGAC CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
|
Protein sequence | MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTAALVTW FTLRLWRDKR LALLAAVIYL SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWKG KSAGFLLLGI TCGMGVMTKG FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLAIAQRE PDFWHYFFWV EHIQRFAMDD AQHRAPFWYY LPVIIAGSLP WLGLLPGALY AGWKNRKHSA TVYLLSWTIM PLLFFSVAKG KLPTYILSCF APLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWSFA ALCPLGLALL VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE RLVLIQYRPK
|
| |