Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2160 |
Symbol | |
ID | 6145516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2163691 |
End bp | 2165853 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617035 |
Product | hypothetical protein |
Protein accession | YP_001744209 |
Protein GI | 170679898 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01666] hypothetical membrane protein, TIGR01666 [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00328985 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.533884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTTA TGCTAAGTCC TTTGCTCAAA CGCTATACCT GGAACAGCGC CTGGCTGTAT TACGCGCGTA TTTTTATTGC GCTTTGTGGA ACCACAGCGT TTCCGTGGTG GCTGGGTGAT GTAAAACTGA CGATTCCGCT GACGCTGGGG ATGGTGGCAG CGGCGCTGAC CGATCTCGAT GACCGACTGG CGGGACGTTT GCGTAACCTC ATCATTACGC TGTTCTGCTT TTTTATCGCC TCGGCCTCAG TAGAACTGCT GTTTCCCTGG CCCTGGCTAT TTGCGATTGG CTTAACGCTC TCAACCAGCG GCTTCATTTT GCTCGGCGGT CTGGGTCAAC GCTATGCAAC AATTGCCTTC GGTGCGTTAC TGATCGCCAT TTACACTATG TTGGGGACGT CACTGTATGA GCACTGGTAT CAGCAGCCGA TGTATCTGCT GGCCGGTGCC GTCTGGTACA ACGTCCTGAC ACTTATTGGT CATCTACTGT TCCCGGTCCG CCCGCTGCAG GACAACCTGG CGCGTTGCTA TGAACAACTG GCGCGTTATC TTGAGCTCAA GTCGCGCATG TTTGATCCTG ATATTGAAGA TGAAAGCCAG GCACCGCTGT ACGATTTGGC TCTCGCCAAC GGTCAGCTGA TGGCGACATT GAATCAGACG AAACTCTCGC TGCTGACCCG CTTACGTGGC GATCGTGGTC AACGGGGAAC GCGTCGCACA CTGCATTATT ACTTTGTCGC GCAGGATATT CACGAGCGTG CCAGCTCTTC TCATATTCAG TATCAAACAT TGCGTGAACA TTTTCGCCAC AGCGACGTGC TGTTCCGTTT TCAGCGGCTG ATGTCGATGC AGGGCCAGGC GTGCCAGCAA CTGTCACGCT GTATTTTGCT GCGTCAGCCT TATCAACATG ATCCGCATTT TGAGCGCGCT TTTACGCATA TTGATGCTGC GCTGGAGCGG ATGCGCGATA ACGGCGCGCC TGCTGATTTA CTCAAAACAC TGGGATTTTT ACTGAACAAT TTACGTGCCA TTGATGCCCA ACTGGCAACA ATTGAATCAG AACAGGCCCA GGCACTGCCC CATAATGATG ACGAAAATGA GCTCGCTGAT GACAGCCCGC ACGGATTGAG TGATATCTGG CTGCGTCTTA GCCGTCACTT CACGCCGGAA TCCGCCCTCT TCCGTCATGC GGTAAGAATG TCGCTGGTGT TGTGCTTCGG CTACGCCATC ATTCAGATAA CCGGACTGCA TCACGGGTAT TGGATCTTGC TGACAAGTTT GTTTGTCTGC CAGCCAAACT ATAACGCCAC GCGCCACCGC CTGAAATTAA GGATTATTGG TACGCTGGTA GGTATCGCGA TTGGCATTCC TGTGCTGTGG TTTGTGCCGT CACTGGAAGG GCAGCTGGTG CTGCTGGTTA TTACCGGCGT GCTCTTTTTT GCCTTCCGTA ACGTGCAATA TGCCCATGCA ACGATGTTCA TCACACTTTT GGTGCTACTT TGTTTTAACT TGCTGGGTGA AGGTTTTGAA GTGGCGTTAC CTCGCGTAAT CGATACGCTG ATTGGTTGTG CCATTGCGTG GGCGGCGGTG AGCTACATCT GGCCTGACTG GAAGTTTCGC AATCTGCCGC GCATGCTCGA ACGTGCCACC GAGGCCAACT GTCGCTATCT CGATGCCATA CTGGAGCAAT ACCATCAGGG GCGTGATAAC CGTCTGGCGT ATCGTATTGC CCGCCGCGAT GCACACAACC GTGATGCTGA ACTGGCGTCG GTGGTATCAA ATATGTCCAG CGAGCCGAAC GTTACCCCGC AAATTCGCGA GGCCGCGTTT CGGTTGCTGT GCCTTAACCA TACGTTTACC AGCTATATCT CAGCCCTCGG TGCTCACCGG GAGCAGTTAA CTAATCCTGA AATTCTGGCG TTTCTTGATG ACGCAGTTTG CTATGTTGAT GACGCGTTAC ATCATCAACC TGCTGATGAA GAACGCGTCA ATCAGGCATT AGCTGGCCTG AAACAGCGGA TGCAGCAACT TGAACCACGG GCAGACAGCA AAGAACCTCT GGTCGTACAA CAAGTTGGGT TATTGATTGC ATTACTGCCA GAGATTGGTC GTCTGCAACG CCAGATTACT CAAGTTCCGC AGGAAACTCC TGTTTCGGCG TAA
|
Protein sequence | MAFMLSPLLK RYTWNSAWLY YARIFIALCG TTAFPWWLGD VKLTIPLTLG MVAAALTDLD DRLAGRLRNL IITLFCFFIA SASVELLFPW PWLFAIGLTL STSGFILLGG LGQRYATIAF GALLIAIYTM LGTSLYEHWY QQPMYLLAGA VWYNVLTLIG HLLFPVRPLQ DNLARCYEQL ARYLELKSRM FDPDIEDESQ APLYDLALAN GQLMATLNQT KLSLLTRLRG DRGQRGTRRT LHYYFVAQDI HERASSSHIQ YQTLREHFRH SDVLFRFQRL MSMQGQACQQ LSRCILLRQP YQHDPHFERA FTHIDAALER MRDNGAPADL LKTLGFLLNN LRAIDAQLAT IESEQAQALP HNDDENELAD DSPHGLSDIW LRLSRHFTPE SALFRHAVRM SLVLCFGYAI IQITGLHHGY WILLTSLFVC QPNYNATRHR LKLRIIGTLV GIAIGIPVLW FVPSLEGQLV LLVITGVLFF AFRNVQYAHA TMFITLLVLL CFNLLGEGFE VALPRVIDTL IGCAIAWAAV SYIWPDWKFR NLPRMLERAT EANCRYLDAI LEQYHQGRDN RLAYRIARRD AHNRDAELAS VVSNMSSEPN VTPQIREAAF RLLCLNHTFT SYISALGAHR EQLTNPEILA FLDDAVCYVD DALHHQPADE ERVNQALAGL KQRMQQLEPR ADSKEPLVVQ QVGLLIALLP EIGRLQRQIT QVPQETPVSA
|
| |