Gene EcE24377A_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2697 
Symbol 
ID5586161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2684953 
End bp2685951 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content52% 
IMG OID640926352 
Productbile acid/Na+ symporter family protein 
Protein accessionYP_001463739 
Protein GI157157547 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.09356e-12 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGTTGCTG 
GCCTCTTTCT TTCCGGCCAG AGGCGATTTC GTCCCCTTCT TTGAAAATCT GACCACCGCA
GCGATTGCCC TGCTGTTCTT TATGCACGGC GCGAAGTTGT CGCGTGAGGC GATTATTGCT
GGCGGTGGTC ACTGGCGACT GCATTTGTGG GTAATTTGCA GCACCTTCGT GCTGTTTCCG
ATTCTGGGTG TACTGTTTGC CTGGTGGAAA CCGGTAAATG TCGACCCGAT GCTCTACTCC
GGTTTTCTCT ACTTGTGCAT TCTCCCGGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA
ATGGCGGGCG GTAACGTCGC GGCGGCGGTT TGTTCTGCGT CAGCATCCAG CCTGCTGGGG
ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC
CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT
TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC
CAGACGTCCA TTCTGTTGGT GGTTTATACA GCGTTCAGCG AAGCCGTCGT TAATGGTATC
TGGCATAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGCTG CGTTCTTCTG
GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGGGCTTCAA TAAGGCAGAT
GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA
AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCCCTAAT GATTTTCCAT
CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG
TTACAGGCAC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
 
Protein sequence
MKLFRILDPF TLTLITVVLL ASFFPARGDF VPFFENLTTA AIALLFFMHG AKLSREAIIA 
GGGHWRLHLW VICSTFVLFP ILGVLFAWWK PVNVDPMLYS GFLYLCILPA TVQSAIAFTS
MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL
SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL
AIVIVVNVFM ARRLGFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH
QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA