Gene ECD_04131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04131 
SymbolidnT 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4402798 
End bp4404117 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID 
ProductL-idonate and D-gluconate transporter 
Protein accessionACT45919 
Protein GI253980249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAA TCATTATTGC GGCAGGCGTC GCGCTGCTTC TTATCCTGAT GATCGTCTTT 
AAAGTTAACG GCTTTATTGC CCTCGTTCTG GTAGCTGCCG TCGTCGGATT TGCCGAAGGG
ATGGATGCAC AGGCCGTCCT GCACTCTATA CAAAATGGTA TCGGCAGCAC GCTCGGCGGG
CTGGCAATGA TCCTCGGTTT CGGGGCCATG TTAGGCAAGC TGATTTCTGA TACGGGTGCG
GCACAACGTA TCGCCACTAC GCTGATTGCT ACTTTTGGTA AAAAACGCGT GCAATGGGCG
CTAGTGATCA CCGGTCTGGT TGTGGGCCTC GCCATGTTTT TTGAAGTGGG TTTTGTCCTG
CTGTTGCCGT TGGTATTTAC CATCGTAGCA TCATCAGGAT TACCCCTGTT GTATGTTGGC
GTACCAATGG TAGCAGCGCT CTCTGTAACC CACTGTTTTC TGCCGCCACA TCCAGGGCCT
ACTGCCATCG CGACTATCTT TGAGGCTAAT CTCGGAACGA CTTTACTGTA TGGATTTATC
ATTACCATTC CGACAGTTAT TGTCGCAGGA CCGCTGTTTT CTAAACTGCT AACTCGCTTT
GAGAAAGCAC CACCGGAAGG CTTATTTAAT CCTCATCTGT TTAGCGAAGA GGAGATGCCC
TCCTTCTGGA ACAGTATTTT CGCTGCCGTG ATCCCGGTCA TCCTGATGGC TATCGCCGCC
GTTTGTGAAA TTACGTTACC GAAAACTAAC ACCGTGCGCC TCTTCTTTGA ATTTGTCGGT
AACCCTGCCG TTGCGCTGTT TATTGCCATT GTTATTGCGA TTTTCACACT GGGCCGACGT
AATGGACGCA CCATCGAGCA AATCATGGAT ATCATTGGGG ATTCTATAGG CGCTATCGCG
ATGATTGTGT TTATTATCGC TGGCGGCGGC GCGTTTAAGC AGGTATTAGT AGATAGCGGT
GTCGGGCACT ATATTTCACA CTTAATGACC GGAACTACAC TTTCGCCGTT ATTGATGTGC
TGGACTGTTG CGGCGCTGTT GCGTATCGCT CTGGGCTCTG CCACCGTCGC GGCCATTACC
ACCGCGGGTG TGGTGTTGCC GATTATCAAC GTTACCCATG CCGATCCCGC TTTAATGGTA
CTGGCAACCG GTGCGGGCAG CGTGATCGCG TCACACGTAA ACGACCCTGG CTTCTGGCTA
TTTAAAGGGT ATTTTAATCT GACGGTTGGT GAAACGTTGC GTACCTGGAC GGTGATGGAA
ACCCTTATTT CTATTATGGG TTTGCTGGGC GTGTTAGCCA TTAACGCCGT ATTGCACTGA
 
Protein sequence
MPLIIIAAGV ALLLILMIVF KVNGFIALVL VAAVVGFAEG MDAQAVLHSI QNGIGSTLGG 
LAMILGFGAM LGKLISDTGA AQRIATTLIA TFGKKRVQWA LVITGLVVGL AMFFEVGFVL
LLPLVFTIVA SSGLPLLYVG VPMVAALSVT HCFLPPHPGP TAIATIFEAN LGTTLLYGFI
ITIPTVIVAG PLFSKLLTRF EKAPPEGLFN PHLFSEEEMP SFWNSIFAAV IPVILMAIAA
VCEITLPKTN TVRLFFEFVG NPAVALFIAI VIAIFTLGRR NGRTIEQIMD IIGDSIGAIA
MIVFIIAGGG AFKQVLVDSG VGHYISHLMT GTTLSPLLMC WTVAALLRIA LGSATVAAIT
TAGVVLPIIN VTHADPALMV LATGAGSVIA SHVNDPGFWL FKGYFNLTVG ETLRTWTVME
TLISIMGLLG VLAINAVLH