Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4879 |
Symbol | |
ID | 6147367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4994469 |
End bp | 4996442 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641619683 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001746790 |
Protein GI | 170681566 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.511853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCAA CATTTAACAT ATACCAGGAT ATTCTCCCAG CATTCAACAT GTATTCGGGA CTAAAACCTT GCCATGAAAA AAATAACCAA CCATTTGATA TTAACACGGA AATTGAAACC ATACAAAAAC AAATTAATTA TGATATAAAT CATTTGAATG ACGGTTTGAT TAAGCGTGTA CTGAATCTTT TTATTCACCT TATCTCTAAT CCCGATAATC TCGAATTAAC CTTAAATCGA TATTCATCAA CAACAGAGCA AATCATCGGC AGAACCAAAA GAAATAGTTT ACATGAGTTT GAAGTTGGCG ATCTAAAAAT AATATTTAAT CGACAAGATG ATAATGAAAG CGTATTAACT ATTAAACACA AAGATATAAG TCATGGTTGC AATGTTAAAA CCGAGCAACT GCAGCAGTTT ATTAAAATAA TGGAACAAAA AGCGCAACTA CCAATCTATA TTGACAAGAA CAATTTGAAA GAGAGTATTT TCTCTGTTTT GCGAAATGAC CCACAGCACG TAGATAAAGA GCAATACCTT CCCTGTGATA AGTTTTTAAC ACATGCCTGC AAAAATTCAA ATTCATTTGA AGTGAAATTA GATGCCACTC ATCAATATCA ACATCTGAAT AACTTCATGA TTTCTTTTGA CCCAGTAGAA AATCAATTAA CAATACGGGA TAACAATAAC GAGACTGAAA CTATCTCGTT GACAAACTTA CAATGGGAAA ATGTGCTGCA ATACTACAGA GAAAACCACC AGCAGCCAAA TATAGCAGGA TCACGAAATC TCACGGATAA TATAGATAAA ATTAAAAATA CAATATCCAC CTCTGAAATT ATAGAGTGCG CCTCTCCTGA AATAAGAAGT AGCGTCCTGA ACGATCTTTA TAGCATTGCT AATTTCCTCC CGGACAAAAA TCTGACACCA AATGAGAGCT GGAAAAGATT TTGCCATACA TGCGAGCGCT TTTATGTTGC TCAGAAGAGT ATCACTGGAG ATAACAGTGA ACGTCTTACG CGAAAAGTCT CTATCTCTGA TGCAGGAATT ACAATGACCT TCAAGATAGG TGATGTTGTC ATCAATACTA TTAGCACTGC TATCCCTGAA GATGAATCGG GTCAACGGTG TATAGAAGGG TTGAATTTAG CAGAGATGGA TTTAACCGGC ATAGACTTGT CGAAAATGGC GCTAAGGAAT GTCAATTTTA ATGGCAGCAT TCTTAGAAAT GCGGATTTCT CCGGTACGAT CTGTGAAGGC GTGGATTTTA CCGATTGTGA TCTCCGTTAT GCAACCTTTA TTGACGCCTC ATTAGAAAAA ATCGATTTTC GTAAAGTTCG CCACTTGTTT AATATAAATT TTACAAATGC AAATCTACGG AACAGCAACT TCAGCGGAAA AGTTCTCACT GGCGTTAATT TTACTGGAAG TGACCTTAGT AACGCTTATC TTGAACACAT AGATTTCACA AAAGTGATTT TCTTTCCGTC ATTAATTATT GGAGCAGTAT TTGATAATTC CAATCTATCA GAGAAAAATC TTTCAGACAA AGATCTTACT AATATTAGCT GCATGTATAC CAATTTTACT AACGCTAATT TAACAAAATG CAAACTCTTA AATACAAACT TTTCGGCTGC AAAATTTGAC AATACTAATT TCACTGGTAC AAAAGGTTCT AATATTCTGT TTAACCATGC ATGGTTGTTC AATACGATAT TTATAGATAC GATTTTTAAA AATGCCTGTT TTTTCAATGC CAAAGTGAAT AATGTTTCTC TTAAGAAAGC ATATATTTAC AATGATAATA TCGATAAAAA AGCCAATGAC AGTACCGATA AACAAGCCAA GAACAGTACC GAGCAACAGG ACAGTACCAG TTTTAATCAA GCCCGTTTAA AGAAAGAAGT GAATAGCAGT TTTTCCATTC CGGGTTTAAC GTCTTATCAG CCAACGTATA TAGTTGACGA GTAG
|
Protein sequence | MDSTFNIYQD ILPAFNMYSG LKPCHEKNNQ PFDINTEIET IQKQINYDIN HLNDGLIKRV LNLFIHLISN PDNLELTLNR YSSTTEQIIG RTKRNSLHEF EVGDLKIIFN RQDDNESVLT IKHKDISHGC NVKTEQLQQF IKIMEQKAQL PIYIDKNNLK ESIFSVLRND PQHVDKEQYL PCDKFLTHAC KNSNSFEVKL DATHQYQHLN NFMISFDPVE NQLTIRDNNN ETETISLTNL QWENVLQYYR ENHQQPNIAG SRNLTDNIDK IKNTISTSEI IECASPEIRS SVLNDLYSIA NFLPDKNLTP NESWKRFCHT CERFYVAQKS ITGDNSERLT RKVSISDAGI TMTFKIGDVV INTISTAIPE DESGQRCIEG LNLAEMDLTG IDLSKMALRN VNFNGSILRN ADFSGTICEG VDFTDCDLRY ATFIDASLEK IDFRKVRHLF NINFTNANLR NSNFSGKVLT GVNFTGSDLS NAYLEHIDFT KVIFFPSLII GAVFDNSNLS EKNLSDKDLT NISCMYTNFT NANLTKCKLL NTNFSAAKFD NTNFTGTKGS NILFNHAWLF NTIFIDTIFK NACFFNAKVN NVSLKKAYIY NDNIDKKAND STDKQAKNST EQQDSTSFNQ ARLKKEVNSS FSIPGLTSYQ PTYIVDE
|
| |