Gene ECD_00072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00072 
SymbolsetA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp80426 
End bp81604 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID 
Productbroad specificity sugar efflux system 
Protein accessionACT41973 
Protein GI253976303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGGA TAATGACGAT GGCTCGCCGT ATGAACGGTG TTTACGCGGC ATTTATGCTG 
GTCGCTTTTA TGATGGGGGT GGCCGGGGCG CTACAGGCTC CTACATTGAG CTTATTTCTG
AGTCGTGAGG TTGGCGCGCA ACCTTTCTGG ATCGGACTCT TTTATACGGT GAATGCTATT
GCTGGGATCG GCGTAAGCCT CTGGTTGGCA AAACGTTCTG ACAGTCAGGG CGATCGGCGA
AAACTGATTA TATTTTGCTG TTTGATGGCT ATCGGCAATG CGCTATTGTT TGCATTTAAT
CGTCATTATC TGACGCTTAT CACCTGTGGT GTGCTTCTGG CATCTCTGGC CAATACGGCA
ATGCCACAGT TATTTGCTCT GGCGCGGGAA TATGCGGATA ACTCGGCGCG AGAAGTGGTG
ATGTTTAGCT CGGTGATGCG TGCGCAGCTT TCTCTGGCAT GGGTTATCGG TCCACCGTTG
GCCTTTATGC TGGCGTTGAA TTACGGCTTT ACGGTGATGT TTTCGATTGC CGCCGGGATA
TTCACACTCA GTCTGGTATT GATTGCATTT ATGCTTCCGT CTGTGGCGCG GGTAGAACTG
CCGTCGGAAA ATGCTTTATC AATGCAAGGT GGCTGGCAGG ATAGTAACGT ACGGATGTTA
TTTGTCGCCT CGACGTTAAT GTGGACCTGC AACACCATGT ACATTATTGA TATGCCGTTG
TGGATCAGTA GCGAGTTAGG ATTGCCAGAC AAACTGGCGG GTTTCCTGAT GGGGACGGCA
GCTGGTCTGG AAATACCAGC AATGATTCTG GCTGGCTACT ATGTCAAACG TTATGGTAAG
CGGCGAATGA TGGTCATAGC AGTGGCGGCA GGAGTACTGT TTTACACCGG ATTGATTTTA
TTTCATAGCC GTCTGGCGTT GATGACGCTG CAACTTTTTA ACGCTGTATT TATCGGCATT
GTTGCGGGTA TTGGGATGCT ATGGTTTCAG GATTTAATGC CTGGAAGAGC GGGGGCAGCT
ACCACCTTAT TTACTAACAG TATTTCTACC GGGGTAATTC TGGCTGGCGT TATTCAGGGA
GCAATTGCAC AAAGTTGGGG GCACTTTGCT GTCTACTGGG TAATTGCGGT TATTTCTGTT
ATCGCATTAT TTTTAACCGC AAAGGTTAAA GACGTTTGA
 
Protein sequence
MIWIMTMARR MNGVYAAFML VAFMMGVAGA LQAPTLSLFL SREVGAQPFW IGLFYTVNAI 
AGIGVSLWLA KRSDSQGDRR KLIIFCCLMA IGNALLFAFN RHYLTLITCG VLLASLANTA
MPQLFALARE YADNSAREVV MFSSVMRAQL SLAWVIGPPL AFMLALNYGF TVMFSIAAGI
FTLSLVLIAF MLPSVARVEL PSENALSMQG GWQDSNVRML FVASTLMWTC NTMYIIDMPL
WISSELGLPD KLAGFLMGTA AGLEIPAMIL AGYYVKRYGK RRMMVIAVAA GVLFYTGLIL
FHSRLALMTL QLFNAVFIGI VAGIGMLWFQ DLMPGRAGAA TTLFTNSIST GVILAGVIQG
AIAQSWGHFA VYWVIAVISV IALFLTAKVK DV