Gene EcHS_A2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2402 
SymbolarnT 
ID5594470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2411695 
End bp2413347 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID640921527 
Product4-amino-4-deoxy-L-arabinose transferase 
Protein accessionYP_001459061 
Protein GI157161743 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.770556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA 
CCGATCAGCA CGCGTCTGCT CTGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA
GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTCGAA
AAACCCATTG CCGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC
TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTGCCGCGCT GGTGACCTGG
TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCTCTAC TCGCCACAGT AATTTATCTC
TCATTGTTTA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCATTC
TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAGAGGC
AAAAGCGCAG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT
TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTGCCAT GGGTAGCGAC GCAAAAACGC
TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTTATCA GTTGCGTACT GACGGTTCTC
CCTTGGGGAC TGGCGATAGC GCAGCGGGAG CCTAACTTCT GGCACTATTT TTTCTGGGTT
GAGCATATTC AACGCTTTGC ACTGGATGAT GCCCAACATA GAGCTCCGTT CTGGTACTAC
GTGCCGGTCA TCATTGCCGG TAGCCTGCCG TGGCTGGGAT TACTCCCCGG TGCACTGTAC
ACAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG
CCGCTGCTGT TTTTCTCCGT CGCTAAAGGT AAATTGCCCA CCTATATTCT TTCCTGCTTT
GCATCTCTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG
GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGCG TCACTGGCAT TATTGCCACA
TTTGTGGTCT CCCCGTGGGG ACCAATGAAC ACGCCGGTGT GGCAAACCTT CGAGAGCTAT
AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA
ACAAACGTCG AAAAGACCTG GCCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG
GTAGGATTTT CAATTCCTGA CAGAGTTATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG
ATGACACAAG AATCACTTCA GCCAAGCCGC TATATTCTTA CTGATAGCGT CGGTGTTGCC
GCAGGTCTGG CATGGAGCCT GCAACGCGAT GACATCATCA TGTATCGCCA GACAGGTGAG
TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGAT TTGTCAGCGG TGATGAGTTC
GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CTCTCGTGCT TTCGGTTGAC
CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG
CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
 
Protein sequence
MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE 
KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTAALVTW FTLRLWRDKR LALLATVIYL
SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWRG KSAGFLLLGI TCGMGVMTKG
FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLAIAQRE PNFWHYFFWV
EHIQRFALDD AQHRAPFWYY VPVIIAGSLP WLGLLPGALY TGWKNRKHSA TVYLLSWTIM
PLLFFSVAKG KLPTYILSCF ASLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT
FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWPFA ALCPLGLALL
VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE
LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE
RLVLIQYRPK