Gene EcolC_1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1392 
SymbolarnT 
ID6067949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1523960 
End bp1525612 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID641600812 
Product4-amino-4-deoxy-L-arabinose transferase 
Protein accessionYP_001724383 
Protein GI170019429 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA 
CCGATCAGCA CGCGTCTGCT CTGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA
GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTCGAA
AAACCCATTG CCGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC
TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTGCCGCGCT GGTGACCTGG
TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCTCTAC TCGCCACAGT AATTTATCTC
TCATTGTTTA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCATTC
TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAAAGGC
AAAAGCGCAG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT
TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTGCCAT GGGTAGCGAC GCAAAAACGC
TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTTATCA GTTGCGTACT GACGGTTCTC
CCTTGGGGAC TGGCGATAGC GCAGCGGGAG CCTAACTTCT GGCACTATTT TTTCTGGGTT
GAGCATATTC AACGCTTTGC ACTGGATGAT GCCCAACATA GAGCTCCGTT CTGGTACTAC
GTGCCGGTCA TCATTGCCGG TAGCCTGCCG TGGCTGGGAT TACTCCCCGG TGCACTGTAC
ACAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG
CCGCTGCTGT TTTTCTCCGT CGCTAAAGGT AAATTGCCCA CCTATATTCT TTCCTGCTTT
GCATCTCTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG
GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGCG TCACTGGCAT TATTGCCACA
TTTGTGGTCT CCCCGTGGGG ACCAATGAAC ACGCCGGTGT GGCAAACCTT CGAGAGCTAT
AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA
ACAAACGTCG AAAAGACCTG GCCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG
GTAGGATTTT CAATTCCTGA CAGAGTTATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG
ATGACACAAG AATCACTTCA GCCAAGCCGC TATATTCTTA CTGATAGCGT CGGTGTTGCC
GCAGGTCTGG CATGGAGCCT GCAACGCGAT GACATCATCA TGTATCGCCA GACAGGTGAG
TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGAT TTGTCAGCGG TGATGAGTTC
GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CTCTCGTGCT TTCGGTTGAC
CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG
CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
 
Protein sequence
MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE 
KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTAALVTW FTLRLWRDKR LALLATVIYL
SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWKG KSAGFLLLGI TCGMGVMTKG
FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLAIAQRE PNFWHYFFWV
EHIQRFALDD AQHRAPFWYY VPVIIAGSLP WLGLLPGALY TGWKNRKHSA TVYLLSWTIM
PLLFFSVAKG KLPTYILSCF ASLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT
FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWPFA ALCPLGLALL
VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE
LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE
RLVLIQYRPK