Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0289 |
Symbol | |
ID | 6146989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 296996 |
End bp | 298135 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615186 |
Product | hypothetical protein |
Protein accession | YP_001742395 |
Protein GI | 170682374 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03073] release factor H-coupled RctB family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.15887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAAT ATATTCGTCC CTTATCTGAT GCGGTACTTA CCATCGCATC TGATGACCTG TGGATCGAGA GTTCAGCGAT CCAACAATTA CACACCACGG CAAATTTACC CGACATGCAA CGCGTAGTAG GGATGCCAGA TTTACACCCC GGACGCGGCT ATCCGATTGG CGCAGCGTTC TTCTCCGTAG GCCGTTTTTA CCCGGCACTG GTCGGCAATG ATATCGGCTG CGGTATGGCG CTATGGCAAA CAGATATTTT CGCGCGCAAA TACAACGCCG ATAAGTTTGA AAAGCGATTA TCTGCGCTGG ATGACGTTGC TGAAGAAAGC TGGCTGGAGG AAAACCTGCC GTCAGCGTTA GCACAGCATC CGTGGCGCAG CTCGCTGGGT TCCATCGGTG GCGGAAACCA CTTCGCAGAA CTGCAACAAG TTGATGAAAT TATCGACGCT GAACTGTTTG CACTGACAGG TCTGGATGCG CAGCATCTGC AACTGCTGGT TCATAGCGGC TCGCGGGGTT TGGGCCAGTC TATTTTACAG CGGCATATTG CCTCGTTTTC GCATCATGGT TTGCCTGAAG GCAGTGACGA CGCGCTAAGT TATATTGTGG AACATGATGA TGCGCTGGCG TTTGCGCGTA TTAATCGCCA GCTGATCGCT TTGCGCATAA TGCAACAGAT TAAGGCCAAC GGTAATTCGG TTCTGGATGT GGCGCATAAC TTTGTTAGCG CGTGTCGAAT CGGTGATCAA CAGGGCTGGT TGCATCGTAA AGGTGCCACA CCGGATGACA ACGGTCTGGT GATTATTCCC GGTTCACGCG GTGATTACTC CTGGCTGGTT CAGCCCGTCA GGAGTGAGGA AACATTGCAT TCGCTGGCGC ATGGGGCTGG GCGTAAATGG GGGCGCACCG AGTGTAAAGG GCGTCTGGCA GCGAAATACA CAGCGACGCA GCTCTCACGT ACTGAACTTG GCAGCCGGGT AATTTGTCGC GATAAACAAC TCATCTTTGA AGAAGCGCCA CAAGCTTATA AATCGGCTGA AAGCGTGGTG CAATGTCTGG TGCAGGCTGG GTTAATTATT CCTGTCGCGC GACTGCGTCC GGTGCTAACG CTCAAAAACA GTGGAGGGAA AAAAGGATGA
|
Protein sequence | MGKYIRPLSD AVLTIASDDL WIESSAIQQL HTTANLPDMQ RVVGMPDLHP GRGYPIGAAF FSVGRFYPAL VGNDIGCGMA LWQTDIFARK YNADKFEKRL SALDDVAEES WLEENLPSAL AQHPWRSSLG SIGGGNHFAE LQQVDEIIDA ELFALTGLDA QHLQLLVHSG SRGLGQSILQ RHIASFSHHG LPEGSDDALS YIVEHDDALA FARINRQLIA LRIMQQIKAN GNSVLDVAHN FVSACRIGDQ QGWLHRKGAT PDDNGLVIIP GSRGDYSWLV QPVRSEETLH SLAHGAGRKW GRTECKGRLA AKYTATQLSR TELGSRVICR DKQLIFEEAP QAYKSAESVV QCLVQAGLII PVARLRPVLT LKNSGGKKG
|
| |