Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0664 |
Symbol | |
ID | 6146372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 676175 |
End bp | 677152 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641615555 |
Product | hypothetical protein |
Protein accession | YP_001742761 |
Protein GI | 170682713 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.575154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTTA CGTCAAGTTG CTGCGATAAT TTATCAATAG AAAAGATTGT TAAACGTGCT GAAAAAGGCG ATAGCGAGGC TCAATATATT GTTGGGTTTT ATTATAATCG CGATAGCGCA ATTGACTCTC CGGACGACGA AAAAGCCTTC TACTGGCTGA AGCTTGCCGC TGAACAAGGC CATTGTGAAG CACAATATTC CTTAGGGCGG AAATACTCCG AGGATAAAAG CCGTCATAAA GATAATGAGC AAGCCATCTT TTGGCTGAAA AAAGCTGCCC TACAAGGCCA TACTTTCGCT TCCAACGCCC TTGGCTGGAT ACTGGATCGT GGAGAAGACC CCAACTATAA AGAAGCGGTT GCCTGGTATC AGATAGCCGC GGAGAGCGGA ATGCCTTATG CGCAAAATAA TCTTGGATGG ATGTACAGAA ATGGTAACGG AGTCGCACAA GACTATGCGC TGGCATTTTT CTGGTACAAA CAAGCTGCAT TACAAGGCCA TAGTTACGCG CAAGACAACC TGGCCGATCT CTATAAAGAC GGAGAAGGCG TTGCTCAAAA CAAGACACTC GCCGCATTCT GGTATTTGAA AAGCGCACAG CAGGGTAATC GGCATGCCCA GTTTCAAATT GCATGGGATT ATAACGCTGG CGAAGGGGTG GACCAGGACT ATAAGCAAGC GATGTACTGG TATCTGAAGG CTGCCGCTCA GGAAAGCGTC GGCGCTTACG TCAACATCGG TTATATGTAT AAACATGGAC AAGGCGTTGA GAAAGATTAT CAGGCCGCCT TTGAATGGTT TATGAAAGCC GCTGAATGCA ATGACGCCAC TGCCTGGTAT AACCTGGCCA TTATGTATCA CTACGGAGAA GGAAGACCTG TCGATCTCCG ACAGGCTCTC GACCTGTATC GTAAAGTTCA GGCATCCGGT ACCAGGGATG TCAGTCAAGA AATCCGTGAG ATTGAAGATT TACTGTAA
|
Protein sequence | MIFTSSCCDN LSIEKIVKRA EKGDSEAQYI VGFYYNRDSA IDSPDDEKAF YWLKLAAEQG HCEAQYSLGR KYSEDKSRHK DNEQAIFWLK KAALQGHTFA SNALGWILDR GEDPNYKEAV AWYQIAAESG MPYAQNNLGW MYRNGNGVAQ DYALAFFWYK QAALQGHSYA QDNLADLYKD GEGVAQNKTL AAFWYLKSAQ QGNRHAQFQI AWDYNAGEGV DQDYKQAMYW YLKAAAQESV GAYVNIGYMY KHGQGVEKDY QAAFEWFMKA AECNDATAWY NLAIMYHYGE GRPVDLRQAL DLYRKVQASG TRDVSQEIRE IEDLL
|
| |