Gene EcSMS35_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2160 
Symbol 
ID6145516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2163691 
End bp2165853 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content52% 
IMG OID641617035 
Producthypothetical protein 
Protein accessionYP_001744209 
Protein GI170679898 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01666] hypothetical membrane protein, TIGR01666
[TIGR01667] integral membrane protein, YccS/YhfK family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00328985 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.533884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTTA TGCTAAGTCC TTTGCTCAAA CGCTATACCT GGAACAGCGC CTGGCTGTAT 
TACGCGCGTA TTTTTATTGC GCTTTGTGGA ACCACAGCGT TTCCGTGGTG GCTGGGTGAT
GTAAAACTGA CGATTCCGCT GACGCTGGGG ATGGTGGCAG CGGCGCTGAC CGATCTCGAT
GACCGACTGG CGGGACGTTT GCGTAACCTC ATCATTACGC TGTTCTGCTT TTTTATCGCC
TCGGCCTCAG TAGAACTGCT GTTTCCCTGG CCCTGGCTAT TTGCGATTGG CTTAACGCTC
TCAACCAGCG GCTTCATTTT GCTCGGCGGT CTGGGTCAAC GCTATGCAAC AATTGCCTTC
GGTGCGTTAC TGATCGCCAT TTACACTATG TTGGGGACGT CACTGTATGA GCACTGGTAT
CAGCAGCCGA TGTATCTGCT GGCCGGTGCC GTCTGGTACA ACGTCCTGAC ACTTATTGGT
CATCTACTGT TCCCGGTCCG CCCGCTGCAG GACAACCTGG CGCGTTGCTA TGAACAACTG
GCGCGTTATC TTGAGCTCAA GTCGCGCATG TTTGATCCTG ATATTGAAGA TGAAAGCCAG
GCACCGCTGT ACGATTTGGC TCTCGCCAAC GGTCAGCTGA TGGCGACATT GAATCAGACG
AAACTCTCGC TGCTGACCCG CTTACGTGGC GATCGTGGTC AACGGGGAAC GCGTCGCACA
CTGCATTATT ACTTTGTCGC GCAGGATATT CACGAGCGTG CCAGCTCTTC TCATATTCAG
TATCAAACAT TGCGTGAACA TTTTCGCCAC AGCGACGTGC TGTTCCGTTT TCAGCGGCTG
ATGTCGATGC AGGGCCAGGC GTGCCAGCAA CTGTCACGCT GTATTTTGCT GCGTCAGCCT
TATCAACATG ATCCGCATTT TGAGCGCGCT TTTACGCATA TTGATGCTGC GCTGGAGCGG
ATGCGCGATA ACGGCGCGCC TGCTGATTTA CTCAAAACAC TGGGATTTTT ACTGAACAAT
TTACGTGCCA TTGATGCCCA ACTGGCAACA ATTGAATCAG AACAGGCCCA GGCACTGCCC
CATAATGATG ACGAAAATGA GCTCGCTGAT GACAGCCCGC ACGGATTGAG TGATATCTGG
CTGCGTCTTA GCCGTCACTT CACGCCGGAA TCCGCCCTCT TCCGTCATGC GGTAAGAATG
TCGCTGGTGT TGTGCTTCGG CTACGCCATC ATTCAGATAA CCGGACTGCA TCACGGGTAT
TGGATCTTGC TGACAAGTTT GTTTGTCTGC CAGCCAAACT ATAACGCCAC GCGCCACCGC
CTGAAATTAA GGATTATTGG TACGCTGGTA GGTATCGCGA TTGGCATTCC TGTGCTGTGG
TTTGTGCCGT CACTGGAAGG GCAGCTGGTG CTGCTGGTTA TTACCGGCGT GCTCTTTTTT
GCCTTCCGTA ACGTGCAATA TGCCCATGCA ACGATGTTCA TCACACTTTT GGTGCTACTT
TGTTTTAACT TGCTGGGTGA AGGTTTTGAA GTGGCGTTAC CTCGCGTAAT CGATACGCTG
ATTGGTTGTG CCATTGCGTG GGCGGCGGTG AGCTACATCT GGCCTGACTG GAAGTTTCGC
AATCTGCCGC GCATGCTCGA ACGTGCCACC GAGGCCAACT GTCGCTATCT CGATGCCATA
CTGGAGCAAT ACCATCAGGG GCGTGATAAC CGTCTGGCGT ATCGTATTGC CCGCCGCGAT
GCACACAACC GTGATGCTGA ACTGGCGTCG GTGGTATCAA ATATGTCCAG CGAGCCGAAC
GTTACCCCGC AAATTCGCGA GGCCGCGTTT CGGTTGCTGT GCCTTAACCA TACGTTTACC
AGCTATATCT CAGCCCTCGG TGCTCACCGG GAGCAGTTAA CTAATCCTGA AATTCTGGCG
TTTCTTGATG ACGCAGTTTG CTATGTTGAT GACGCGTTAC ATCATCAACC TGCTGATGAA
GAACGCGTCA ATCAGGCATT AGCTGGCCTG AAACAGCGGA TGCAGCAACT TGAACCACGG
GCAGACAGCA AAGAACCTCT GGTCGTACAA CAAGTTGGGT TATTGATTGC ATTACTGCCA
GAGATTGGTC GTCTGCAACG CCAGATTACT CAAGTTCCGC AGGAAACTCC TGTTTCGGCG
TAA
 
Protein sequence
MAFMLSPLLK RYTWNSAWLY YARIFIALCG TTAFPWWLGD VKLTIPLTLG MVAAALTDLD 
DRLAGRLRNL IITLFCFFIA SASVELLFPW PWLFAIGLTL STSGFILLGG LGQRYATIAF
GALLIAIYTM LGTSLYEHWY QQPMYLLAGA VWYNVLTLIG HLLFPVRPLQ DNLARCYEQL
ARYLELKSRM FDPDIEDESQ APLYDLALAN GQLMATLNQT KLSLLTRLRG DRGQRGTRRT
LHYYFVAQDI HERASSSHIQ YQTLREHFRH SDVLFRFQRL MSMQGQACQQ LSRCILLRQP
YQHDPHFERA FTHIDAALER MRDNGAPADL LKTLGFLLNN LRAIDAQLAT IESEQAQALP
HNDDENELAD DSPHGLSDIW LRLSRHFTPE SALFRHAVRM SLVLCFGYAI IQITGLHHGY
WILLTSLFVC QPNYNATRHR LKLRIIGTLV GIAIGIPVLW FVPSLEGQLV LLVITGVLFF
AFRNVQYAHA TMFITLLVLL CFNLLGEGFE VALPRVIDTL IGCAIAWAAV SYIWPDWKFR
NLPRMLERAT EANCRYLDAI LEQYHQGRDN RLAYRIARRD AHNRDAELAS VVSNMSSEPN
VTPQIREAAF RLLCLNHTFT SYISALGAHR EQLTNPEILA FLDDAVCYVD DALHHQPADE
ERVNQALAGL KQRMQQLEPR ADSKEPLVVQ QVGLLIALLP EIGRLQRQIT QVPQETPVSA