Gene ECD_04071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04071 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4333179 
End bp4334456 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content47% 
IMG OID 
ProductHexuronate transporter 
Protein accessionACT45860 
Protein GI253980190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TAAAGAATTA CCGGTGGCAT ATGATTGCCC TCGTATGCTT TATCACTGTA 
ATCAATTATC TGGACAGAAC GGCATTAGGT ATTGCGGCTC CAACGATTAT GGAGACAACC
GGAATAACTA AAGAGCAATA TTCATGGATT GTCAGTGCAT TCCAGTTGGC CTATACATTA
GGGCAACCGG TAATGGGCTT CTTTATTGAT ACCGTGGGTC TGAAGTTAAG TTTTGCGATA
TGTGCCGCAA TTTGGGGCCT GGCGACAATG GGCCATGCAC TCACCGGAAC GTGGTCTGGT
CTGGCATTTA TGCGCGCCCT GATGGGTTTC AGCGAAGCGT CGGCAATTCC GGCGGGTGTA
AAAACCGCAT CAACATGGTT CCCGGCAAAA GAGCGTGGCG TGGCGACAGG TGTTTTCAAT
ATGGGCACCT CACTCGGCGC GATGCTTGCT CCACCGTTGA TTGCCTGGTG CATTATGTTT
CATAGCTGGC AATTTGCGTT TATTGTTTCA GGTAGCCTTG CTTTGCTCGC GGCTTTATTT
TGGTTCTTTT GTTATAAAGA TCCGAAAGAT GCCAAACGCC TTTCTGATGA AGAGCGCCAC
TATATTGAAT CAGGACAAGA ACAGCATCTT AAAACAGATA AGAAAGAAAA AACGTCAATC
AAGCATATCC TCAGCCAACG TAATTTCTGG GGGATTGGCA TCGCGCGTTT TCTCGCAGAC
CCGGCATGGG GAACCATTAA CTTCTGGGTG CCGATTTTCT TCGTCGAAAC GCTGCATTTT
AGCCTGAAAG AAATTGCCAT GTTCGTCTGG CTGCCTTTCC TGCTGGGCGA TCTCGGCTGT
TTAGCCAGTG GTTTTGTCGC GAAGTTCTTC CACGATCGCG GCGTGAGTTT AATTAACTCA
CGAAGAATTA CCTTCACTAT TGCAGCCGTC ATTATGATGA CGATTGGCCT GGTGAGTATT
GTCGAAAATC CCTACATTGC CGTATTACTG ATTAGTATTG GCGCGTTCTC GCATCAATGT
CTTTCTACTG TAGCAGCAAC TCTGGGTGGC GATCTGTTCA AAAAAGACGA AGTAGCTACC
GCAGTGGGTA TGGCAGGAGC CTGTGCGTGG AGCGGTCAGT TGATTTTCAA CCTGTTTATC
GGGGCATTCG TTCACATTAT CGGCTTCGCG CCGTTCTTTA TTGCCCTGGC TTTCTTTGAC
ATTATTGGCG CCATTGCGCT GTGGACGCTT ATCAAAGTTA AAGATGAAGA ACCGCAAGTA
CAGTTAGCGA CAAGCTAA
 
Protein sequence
MNKIKNYRWH MIALVCFITV INYLDRTALG IAAPTIMETT GITKEQYSWI VSAFQLAYTL 
GQPVMGFFID TVGLKLSFAI CAAIWGLATM GHALTGTWSG LAFMRALMGF SEASAIPAGV
KTASTWFPAK ERGVATGVFN MGTSLGAMLA PPLIAWCIMF HSWQFAFIVS GSLALLAALF
WFFCYKDPKD AKRLSDEERH YIESGQEQHL KTDKKEKTSI KHILSQRNFW GIGIARFLAD
PAWGTINFWV PIFFVETLHF SLKEIAMFVW LPFLLGDLGC LASGFVAKFF HDRGVSLINS
RRITFTIAAV IMMTIGLVSI VENPYIAVLL ISIGAFSHQC LSTVAATLGG DLFKKDEVAT
AVGMAGACAW SGQLIFNLFI GAFVHIIGFA PFFIALAFFD IIGAIALWTL IKVKDEEPQV
QLATS