Gene EcHS_A2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2544 
Symbol 
ID5595117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2554510 
End bp2555508 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content52% 
IMG OID640921665 
Productbile acid/Na+ symporter family protein 
Protein accessionYP_001459192 
Protein GI157161874 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000983747 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGTTGCTG 
GCCTCTTTCT TTCCGGCCAG AGGCGATTTC GTCCCCTTCT TTGAAAATCT GACCACCGCA
GCTATTGCCC TGCTGTTCTT TATGCACGGC GCGAAGTTGT CGCGTGAGGC GATTATTGCT
GGCGGTGGTC ACTGGCGACT GCATTTGTGG GTAATGTGCA GCACCTTCGT GCTGTTTCCG
ATTCTGGGTG TACTGTTTGC CTGGTGGAAA CCGGTAAATG TCGACCCGAT GCTCTACTCC
GGTTTTCTCT ACTTGTGCAT TCTCCCGGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA
ATGGCGGGCG GTAACGTCGC GGCAGCGGTT TGTTCTGCGT CGGCATCCAG CCTGCTGGGG
ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC
CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT
TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC
CAGACGTCCA TTCTGTTGGT GGTTTATACA GCGTTCAGCG AAGCCGTCGT TAATGGTATC
TGGCATAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGCTG CGTTCTTCTG
GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGAGCTTCAA TAAGGCAGAT
GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA
AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCCCTGAT GATTTTCCAT
CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG
TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
 
Protein sequence
MKLFRILDPF TLTLITVVLL ASFFPARGDF VPFFENLTTA AIALLFFMHG AKLSREAIIA 
GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVDPMLYS GFLYLCILPA TVQSAIAFTS
MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL
SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL
AIVIVVNVFM ARRLSFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH
QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA