Gene EcHS_A2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2441 
Symbol 
ID5591779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2451091 
End bp2452923 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content52% 
IMG OID640921564 
Productputative transporter 
Protein accessionYP_001459098 
Protein GI157161780 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGTG AATTGATTTG GGTTCTTTCA TTACTGGCCG TTGCCATCGT CTTGTTTGCG 
ACGGGCAGAG TGCGTATGGA TGCGGTCGCT TTGTTTGTTA TTGTCGCGTT TGCATTAAGC
GGAACGCTGA CAGTCCCAGA AGTATTTTCC GGCTTTTCTG ATCCTAACGT TGTCCTGATT
GCCGCCTTGT TTATTATTGG CGATGGTTTG GTCCGTACCG GTGTTGCCAC CGTAATGGGA
ACATGGCTGG TCAAAGTTGC GGGCAATAGT GAAATCAAAA TGTTGGTTTT GTTGATGCTG
ACCGTCGCGG GGCTTGGCGC GTTTATGAGT TCAACCGGCG TTGTCGCTAT CTTTATTCCC
GTGGTGTTAA GCGTTGCCAT GCGTATGCAA ACGTCGCCGT CACGTCTGAT GATGCCGTTA
AGTTTTGCCG GGCTGATTAG CGGCATGATG ACGCTGGTGG CGACGCCGCC GAACCTGGTA
GTCAACAGTG AATTGCTGCG TGAAGGCTAT CACGGCTTCA GTTTCTTTAG CGTAACACCT
ATTGGCCTGG TCGTGCTGGT GCTGGGTATT TTGTATATGT TAGTGATGCG TTTCATGCTG
AAAGGGGATA CCCAGACCCC GCAGCGCGAA GGCTGGACGC GTCGAACCTT TCGCGATCTT
ATCCGTGAAT ATCGACTGAC CGGGCGTGCG CGACGTCTGG CTATTCGCCC CGGATCGCCA
ATGATTGGTC AACGGCTGGA TGATCTCAAA TTACGTGAGC GTTATGGCGC TAACGTCATC
GGTGTTGAAC GCTGGCGGCG TTTTCGTCGC GTTATCGTGA ACGTTAATGG GGTTTCTGAA
TTTCGCGCGC GTGACGTTTT GCTTATTGAT ATGTCTGCGG CTGATGTCGA TCTCCGGCAA
TTTTGTAGTG AGCAATTGCT GGAGCCGATG GTACTGCGCG GCGAGTATTT TTCTGACCAG
GCCCTTGATG TGGGCATGGC AGAGATTTCA TTAATTCCTG AGTCAGAACT GATTGGTAAA
TCGGTGCGCG AAATTGGTTT TCGTACCCGC TACGGACTGA ATGTGGTGGG GCTAAAGCGC
AATGGCGTGG CGCTGGAAGG TTCGCTGGCG GATGAGCCTC TGCTGCTGGG CGATATCATC
CTGGTTGTGG GTAACTGGAA ACTGATCGGT ATGCTGGCCA AACAGGGCCG CGACTTCGTA
GCGCTGAACT TACCGGAAGA GGTGAGTGAA GCATCGCCCG CGCACAGCCA GGCACCCCAT
GCCATTTTCT GTCTGGTGCT AATGGTGGCG TTAATGCTGA CAGATGAAAT TCCTAATCCT
GTTGCCGCTA TCATCGCCTG CCTGCTGATG GGGAAATTCC GCTGTATAGA TGCTGAAAGC
GCCTATAAAT CCATTCACTG GCCGAGCATT ATTTTGATCG TTGGGATGAT GCCGTTTGCT
GTGGCATTAC AGAAAACGGG AGGTGTCGCG CTGGCGGTGA AAGGGCTGAT GGACATTGGC
GGTGGTTACG GGCCACATAT GATGCTGGGG TGTTTGTTTG TCTTGTCGGC GGTTATTGGG
CTATTTATCT CTAATACCGC GACGGCGGTG TTGATGGCTC CGATTGCGCT GGCTGCTGCC
AAAACGATGG GGGTGTCGCC TTATCCATTC GCGATGATCG TGGCGATGGC AGCATCCGCC
GCCTTTATGA CACCGGTTTC TTCACCTGTT AACACACTGG TTTTAGGTCC GGGAAATTAC
AGCTTCAGTG ACTTTGTGAA GTTGGGGGTA CCGTTCACCA TTATCGTGAT GGCGGTTTGT
GTGGTGATGA TCCCGATGCT GTTTCCGTTT TGA
 
Protein sequence
MNGELIWVLS LLAVAIVLFA TGRVRMDAVA LFVIVAFALS GTLTVPEVFS GFSDPNVVLI 
AALFIIGDGL VRTGVATVMG TWLVKVAGNS EIKMLVLLML TVAGLGAFMS STGVVAIFIP
VVLSVAMRMQ TSPSRLMMPL SFAGLISGMM TLVATPPNLV VNSELLREGY HGFSFFSVTP
IGLVVLVLGI LYMLVMRFML KGDTQTPQRE GWTRRTFRDL IREYRLTGRA RRLAIRPGSP
MIGQRLDDLK LRERYGANVI GVERWRRFRR VIVNVNGVSE FRARDVLLID MSAADVDLRQ
FCSEQLLEPM VLRGEYFSDQ ALDVGMAEIS LIPESELIGK SVREIGFRTR YGLNVVGLKR
NGVALEGSLA DEPLLLGDII LVVGNWKLIG MLAKQGRDFV ALNLPEEVSE ASPAHSQAPH
AIFCLVLMVA LMLTDEIPNP VAAIIACLLM GKFRCIDAES AYKSIHWPSI ILIVGMMPFA
VALQKTGGVA LAVKGLMDIG GGYGPHMMLG CLFVLSAVIG LFISNTATAV LMAPIALAAA
KTMGVSPYPF AMIVAMAASA AFMTPVSSPV NTLVLGPGNY SFSDFVKLGV PFTIIVMAVC
VVMIPMLFPF