Gene ECD_01950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01950 
SymbolwcaL 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2014939 
End bp2016159 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID 
Productpredicted glycosyl transferase 
Protein accessionACT43801 
Protein GI253978131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.144977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCACTGTCGT CGGAAACCTT CGTCCTCAAT 
CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC
GACACACAAA ACACCCACGC GGCATGGACA AAATATAACC TTGCCGCCAG AACCCGCTGG
TTACAGGACG AGCCACAAGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACGTTG
CGCGGTATTC ATCGTAAAAA TACCTGGCAG GCGCTTAACC TCAAACGCTA TGGTGCTGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCATGCCGAT
GTGTTTATCG CTCATTTTGG CCCTGCTGGG GTAACCGCGG CAAAACTACG CGAACTGGGT
GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCA
ATAAGCAATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA
ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG
ATTGAAGCCT GCCGCCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGCACC CTCATCGAAC AATATCAACT GGAAGATGTG
ATAGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA TGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGCGATA TGGAAGGTAT TCCGGTGGCG
CTAATGGAAG CGATGGCGGT TGGCATTCCG GTTGTTTCTA CTCTGCACAG CGGAATACCG
GAACTGGTGG AGGCCGATAA ATCCGGTTGG CTGGTGCCTG AGAACGATGC CTGTGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AACTTACTAC GGTTGTCAAA
CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC
AGCTTGCTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW 
LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHAD
VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISNLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDACAL
AQRLAAFSQL DTDELTTVVK RAREKVEHDF NQQVINRELA SLLQAL