Gene B21_01393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01393 
SymbolmdoD 
ID8114860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1450327 
End bp1451982 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content51% 
IMG OID644847636 
Producthypothetical protein 
Protein accessionYP_002999209 
Protein GI251784905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGTA GACGATTTAT TAAAGGTTCA ATGGCTATGG CCGCCGTGTG CGGTACCAGC 
GGCATTGCTT CTCTTTTTTC TCAGGCGGCA TTCGCGGCAG ATTCTGATAT TGCCGACGGG
CAAACCCAGC GTTTTGACTT CTCCATTCTA CAGTCAATGG CGCACGACTT AGCGCAAACA
GCGTGGCGTG GTGCGCCTCG TCCGTTACCT GACACGCTGG CGACAATGAC GCCGCAGGCT
TATAACAGTA TTCAATACGA CGCCGAAAAA TCGCTCTGGC ATAACGTTGA GAACCGTCAA
CTGGACGCTC AGTTCTTCCA TATGGGAATG GGATTCCGTC GCCGCGTTCG TATGTTTTCT
GTAGATCCAG CAACACATCT GGCGCGTGAA ATTCACTTTC GCCCGGAGTT GTTCAAATAC
AACGATGCAG GTGTTGATAC AAAACAATTA GAAGGGCAAA GCGATCTCGG CTTTGCCGGT
TTTCGCGTGT TTAAAGCCCC CGAACTGGCG CGCCGTGATG TAGTATCATT TCTCGGCGCG
AGTTATTTCC GCGCCGTTGA TGATACATAT CAATACGGTT TGTCGGCCCG CGGCCTGGCG
ATCGACACTT ACACCGACAG TAAAGAAGAG TTCCCCGACT TTACCGCCTT CTGGTTTGAT
ACGGTAAAAC CGGGGGCAAC TACCTTTACC GTTTATGCGT TGCTCGATAG CGCCAGCATT
ACTGGTGCCT ATAAGTTCAC TATCCATTGT GAGAAAAGTC AGGTGATTAT GGATGTGGAA
AATCACCTGT ATGCGCGCAA AGACATTAAA CAGCTGGGCA TTGCGCCGAT GACCAGTATG
TTCAGCTGCG GTACTAATGA ACGTCGGATG TGCGATACAA TTCATCCGCA AATTCATGAC
TCTGATCGTC TGTCCATGTG GCGGGGCAAC GGCGAGTGGA TTTGCCGTCC GCTGAATAAT
CCGCAAAAAT TGCAGTTCAA TGCTTACACC GACAACAACC CGAAAGGGTT TGGTTTATTG
CAACTGGATC GTGACTTCTC CCATTATCAG GACATTATGG GCTGGTATAA CAAACGCCCA
AGTCTGTGGG TGGAACCGCG TAACAAGTGG GGTAAGGGCA CCATCGGCCT GATGGAAATC
CCAACAACGG GCGAAACGCT GGATAACATT GTCTGCTTCT GGCAGCCAGA AAAAGCTGTA
AAAGCAGGTG ATGAGTTTGC ATTCCAGTAT CGTCTGTACT GGAGTGCGCA ACCGCCTGTT
CATTGCCCAT TAGCGCGCGT TATGGCGACG CGTACCGGCA TGGGCGGTTT CTCGGAAGGT
TGGGCGCCAG GTGAACACTA TCCCGAAAAA TGGGCGCGTC GTTTTGCCGT CGATTTCGTT
GGTGGTGATC TGAAAGCTGC CGCGCCAAAA GGCATTGAGC CGGTGATTAC GCTTTCCAGT
GGGGAAGCGA AGCAAATCGA AATTCTCTAT ATTGAACCCA TCGATGGTTA TCGTATTCAG
TTTGACTGGT ATCCGACTTC GGACTCCACT GATCCGGTCG ATATGCGGAT GTATCTACGT
TGTCAGGGGG ACGCTATCAG TGAAACATGG CTGTATCAGT ATTTCCCGCC AGCGCCGGAT
AAACGTCAGT ATGTTGACGA CCGCGTGATG AGTTAA
 
Protein sequence
MDRRRFIKGS MAMAAVCGTS GIASLFSQAA FAADSDIADG QTQRFDFSIL QSMAHDLAQT 
AWRGAPRPLP DTLATMTPQA YNSIQYDAEK SLWHNVENRQ LDAQFFHMGM GFRRRVRMFS
VDPATHLARE IHFRPELFKY NDAGVDTKQL EGQSDLGFAG FRVFKAPELA RRDVVSFLGA
SYFRAVDDTY QYGLSARGLA IDTYTDSKEE FPDFTAFWFD TVKPGATTFT VYALLDSASI
TGAYKFTIHC EKSQVIMDVE NHLYARKDIK QLGIAPMTSM FSCGTNERRM CDTIHPQIHD
SDRLSMWRGN GEWICRPLNN PQKLQFNAYT DNNPKGFGLL QLDRDFSHYQ DIMGWYNKRP
SLWVEPRNKW GKGTIGLMEI PTTGETLDNI VCFWQPEKAV KAGDEFAFQY RLYWSAQPPV
HCPLARVMAT RTGMGGFSEG WAPGEHYPEK WARRFAVDFV GGDLKAAAPK GIEPVITLSS
GEAKQIEILY IEPIDGYRIQ FDWYPTSDST DPVDMRMYLR CQGDAISETW LYQYFPPAPD
KRQYVDDRVM S