Gene EcolC_1269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1269 
Symbol 
ID6067064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1387291 
End bp1388289 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content52% 
IMG OID641600684 
Productbile acid:sodium symporter 
Protein accessionYP_001724262 
Protein GI170019308 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000908139 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000128844 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGTTGCTG 
GCCTCTTTCT TTCCGGCCAG AGGCGATTTC GTCCCCTTCT TTGAAAATCT GACCACCGCA
GCTATTGCCC TGCTGTTCTT TATGCACGGC GCGAAGTTGT CGCGTGAGGC GATTATTGCT
GGCGGTGGTC ACTGGCGACT GCATTTGTGG GTAATGTGCA GCACCTTCGT GCTGTTTCCG
ATTCTGGGTG TACTGTTTGC CTGGTGGAAA CCGGTAAATG TCGACCCGAT GCTCTACTCC
GGTTTTCTCT ACTTGTGCAT TCTCCCGGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA
ATGGCGGGCG GTAACGTCGC GGCAGCGGTT TGTTCTGCGT CGGCATCCAG CCTGCTGGGG
ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC
CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT
TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC
CAGACGTCCA TTCTGTTGGT GGTTTATACA GCGTTCAGCG AAGCCGTCGT TAATGGTATC
TGGCATAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGCTG CGTTCTTCTG
GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGAGCTTCAA TAAGGCAGAT
GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA
AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCCCTGAT GATTTTCCAT
CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG
TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
 
Protein sequence
MKLFRILDPF TLTLITVVLL ASFFPARGDF VPFFENLTTA AIALLFFMHG AKLSREAIIA 
GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVDPMLYS GFLYLCILPA TVQSAIAFTS
MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL
SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL
AIVIVVNVFM ARRLSFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH
QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA