Gene EcSMS35_2565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2565 
Symbol 
ID6143329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2617924 
End bp2618922 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content51% 
IMG OID641617435 
Productbile acid/Na+ symporter family protein 
Protein accessionYP_001744600 
Protein GI170680783 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000897855 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGCTGCTG 
GCCTCTTTCT TTCCGGCCAG AGGCGATTCC GTTCCCTTCT TTGAAAATCT GACTACCGCA
GCCATTGCTC TGCTGTTCTT TATGCACGGC GCGAAGTTAT CGCGAGAAGC CATTATTGCA
GGCGGTGGTC ATTGGCGACT GCATTTATGG GTGATGTGTA GCACCTTCGT GCTGTTCCCA
ATTTTGGGCG TGCTGTTTGC CTGGTGGAAA CCGGTAAATG TCGCCCCGAT GCTCTATTCC
GGTTTTCTCT ACTTGTGCAT TCTTCCTGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA
ATGGCGGGCG GTAACGTCGC GGCGGCGGTT TGTTCTGCGT CAGCATCCAG CCTGCTGGGG
ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC
CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT
TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC
CAGACGTCCA TTCTGTTGGT GGTTTATACG GCATTCAGCG AAGCCGTCGT TAACGGTATC
TGGCACAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGTTG CGTTCTTCTG
GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGGGCTTCAA TAAGGCAGAT
GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA
AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCTCTGAT GATTTTCCAT
CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG
TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
 
Protein sequence
MKLFRILDPF TLTLITVVLL ASFFPARGDS VPFFENLTTA AIALLFFMHG AKLSREAIIA 
GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVAPMLYS GFLYLCILPA TVQSAIAFTS
MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL
SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL
AIVIVVNVFM ARRLGFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH
QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA