Gene EcHS_A4521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4521 
SymbolidnT 
ID5593984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4527332 
End bp4528651 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID640923617 
ProductGnt-II system L-idonate transporter 
Protein accessionYP_001461058 
Protein GI157163740 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00247449 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAA TCATTATTGC GGCAGGCGTC GCGCTGCTTC TTATCCTGAT GATCGGCTTT 
AAAGTTAACG GCTTTATTGC CCTCGTTCTG GTAGCTGCCG TCGTCGGATT TGCCGAAGGG
ATGGATGCAC AGGCCGTCCT GCACTCTATA CAAAATGGTA TCGGCAGCAC GCTCGGCGGG
CTGGCAATGA TTCTCGGTTT CGGGGCCATG TTAGGCAAGC TGATTTCTGA TACGGGTGCG
GCACAACGTA TCGCCACTAC GCTGATTGCT ACTTTTGGTA AAAAACGCGT GCAATGGGCG
CTAGTGATCA CCGGTCTGGT TGTGGGCCTC GCCATGTTTT TTGAAGTGGG TTTTGTCCTG
CTGTTGCCGT TGGTATTTAC CATCGTAGCA TCATCAGGAT TACCCCTGTT GTATGTTGGC
GTACCAATGG TAGCAGCGCT CTCTGTAACC CACTGTTTTC TGCCGCCACA TCCAGGGCCT
ACTGCCATCG CGACTATCTT TGAGGCTAAT CTCGGAACGA CTTTACTGTA TGGATTTATC
ATTACCATTC CGACAGTTAT TGTCGCAGGA CCGCTGTTTT CTAAACTGCT AACTCGCTTT
GAGAAAGCAC CACCGGAAGG CTTATTTAAT CCTCATCTGT TTAGCGAAGA GGAGATGCCC
TCCTTCTGGA ACAGTATTTT CGCTGCCGTG ATCCCGGTCA TCCTGATGGC TATCGCCGCC
GTTTGTGAAA TTACGTTACC GAAAACTAAC ACCGTGCGCC TCTTCTTTGA ATTTGTCGGT
AACCCTGCCG TTGCGCTGTT TATTGCCATT GTTATTGCGA TTTTCACACT GGGCCGACGT
AATGGACGCA CCATCGAGCA AATCATGGAT ATCATTGGGG ATTCTATAGG CGCTATCGCG
ATGATTGTGT TTATTATCGC TGGCGGCGGC GCGTTTAAGC AGGTATTAGT AGATAGCGGT
GTCGGGCACT ATATTTCACA CTTAATGACC GGAACTACGC TTTCGCCGTT ATTGATGTGC
TGGACTGTTG CGGCGCTGTT GCGTATCGCT CTGGGCTCTG CCACCGTCGC GGCCATTACC
ACCGCGGGTG TGGTGTTGCC GATTATCAAC GTTACCCATG CCGATCCCGC TTTAATGGTA
CTGGCAACCG GTGCGGGCAG CGTGATCGCG TCACACGTAA ACGACCCTGG CTTCTGGCTA
TTTAAAGGGT ATTTTAATCT GACGGTTGGT GAAACGTTGC GTACCTGGAC GGTGATGGAA
ACCCTTATTT CTATTATGGG TTTGCTGGGC GTGTTAGCCA TTAACGCCGT ATTGCACTGA
 
Protein sequence
MPLIIIAAGV ALLLILMIGF KVNGFIALVL VAAVVGFAEG MDAQAVLHSI QNGIGSTLGG 
LAMILGFGAM LGKLISDTGA AQRIATTLIA TFGKKRVQWA LVITGLVVGL AMFFEVGFVL
LLPLVFTIVA SSGLPLLYVG VPMVAALSVT HCFLPPHPGP TAIATIFEAN LGTTLLYGFI
ITIPTVIVAG PLFSKLLTRF EKAPPEGLFN PHLFSEEEMP SFWNSIFAAV IPVILMAIAA
VCEITLPKTN TVRLFFEFVG NPAVALFIAI VIAIFTLGRR NGRTIEQIMD IIGDSIGAIA
MIVFIIAGGG AFKQVLVDSG VGHYISHLMT GTTLSPLLMC WTVAALLRIA LGSATVAAIT
TAGVVLPIIN VTHADPALMV LATGAGSVIA SHVNDPGFWL FKGYFNLTVG ETLRTWTVME
TLISIMGLLG VLAINAVLH