Gene ECD_02958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02958 
SymbolygjU 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3106243 
End bp3107487 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content55% 
IMG OID 
Productsodium:serine/threonine symporter 
Protein accessionACT44762 
Protein GI253979092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACGC AACGTTCACC GGGGCTATTC CGGCGTCTGG CTCATGGCAG CCTGGTAAAA 
CAAATCCTGG TCGGCCTTGT TCTGGGGATT CTTCTGGCAT GGATCTCAAA ACCCGCGGCG
GAAGCTGTTG GTCTGTTAGG TACTTTGTTC GTCGGCGCAC TGAAAGCTGT TGCCCCCATC
CTGGTGCTGA TGCTGGTAAT GGCATCTATT GCTAACCACC AGCACGGGCA GAAAACCAAT
ATCCGTCCTA TTTTGTTCCT CTATCTGCTG GGCACCTTCT CTGCAGCTCT GGCCGCAGTA
GTCTTCAGCT TTGCCTTCCC TTCTACCCTG CACTTGTCCA GTAGCGCGGG TGATATTTCG
CCGCCGTCAG GCATTGTCGA AGTGATGCGC GGACTGGTAA TGAGCATGGT TTCCAACCCC
ATTGACGCGC TGCTGAAAGG TAACTACATC GGGATCCTGG TGTGGGCAAT TGGCCTCGGC
TTCGCACTGC GTCACGGTAA CGAGACCACC AAAAACCTGG TCAACGATAT GTCGAATGCC
GTCACCTTTA TGGTGAAACT GGTCATTCGC TTCGCACCGA TCGGTATTTT TGGGCTGGTT
TCTTCTACCC TGGCAACCAC CGGTTTCTCC ACACTGTGGG GCTACGCGCA ACTGCTGGTC
GTGCTGGTTG GCTGTATGTT ACTGGTGGCG CTGGTGGTTA ACCCATTGCT GGTGTGGTGG
AAAATTCGTC GTAACCCGTT CCCGCTGGTG CTGCTGTGCC TGCGCGAAAG CGGCGTGTAT
GCCTTCTTCA CCCGCAGCTC TGCAGCTAAC ATTCCGGTGA ATATGGCGCT GTGTGAAAAG
CTGAATCTGG ATCGCGATAC CTATTCCGTT TCTATTCCGC TGGGTGCCAC CATCAATATG
GCGGGCGCAG CAATCACCAT TACCGTGTTG ACGCTGGCTG CGGTTAATAC GCTGGGTATT
CCGGTCGATC TGCCCACAGC GCTGCTGTTG AGCGTAGTGG CTTCTCTGTG TGCCTGTGGC
GCATCCGGCG TGGCAGGGGG GTCTCTGCTG CTGATCCCAC TGGCCTGTAA TATGTTCGGT
ATTTCGAACG ATATCGCCAT GCAGGTGGTT GCCGTCGGCT TTATCATCGG CGTATTGCAG
GACTCTTGCG AAACCGCGCT GAACTCTTCA ACTGACGTGC TGTTCACTGC GGCAGCTTGC
CAGGCAGAAG ACGGTCGTCT GGCAAATAGC GCCCTGCGTA ATTAA
 
Protein sequence
MTTQRSPGLF RRLAHGSLVK QILVGLVLGI LLAWISKPAA EAVGLLGTLF VGALKAVAPI 
LVLMLVMASI ANHQHGQKTN IRPILFLYLL GTFSAALAAV VFSFAFPSTL HLSSSAGDIS
PPSGIVEVMR GLVMSMVSNP IDALLKGNYI GILVWAIGLG FALRHGNETT KNLVNDMSNA
VTFMVKLVIR FAPIGIFGLV SSTLATTGFS TLWGYAQLLV VLVGCMLLVA LVVNPLLVWW
KIRRNPFPLV LLCLRESGVY AFFTRSSAAN IPVNMALCEK LNLDRDTYSV SIPLGATINM
AGAAITITVL TLAAVNTLGI PVDLPTALLL SVVASLCACG ASGVAGGSLL LIPLACNMFG
ISNDIAMQVV AVGFIIGVLQ DSCETALNSS TDVLFTAAAC QAEDGRLANS ALRN