Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2322 |
Symbol | |
ID | 6144317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2353487 |
End bp | 2354473 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617196 |
Product | CobW/P47K family protein |
Protein accession | YP_001744369 |
Protein GI | 170681990 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.366233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00913363 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCAGAA CCAACCTCAT CACCGGTTTT CTCGGCAGCG GGAAAACTAC GTCGATTCTT CATCTGTTAG CCCATAAAGA TCCCAACGAA AAATGGGCGG TACTGGTTAA TGAATTTGGG GAAGTCGGAA TTGATGGTGC TTTGCTCGCC GATAGCGGCG CATTGTTGAA AGAGATCCCC GGCGGCTGTA TGTGCTGCGT TAATGGTTTA CCCATGCAGG TAGGGTTGAA TACCTTACTG CGTCAGGGAA AACCAGACCG CTTGTTGATA GAGCCGACCG GGCTGGGCCA TCCGAAACAG ATCCTCGATC TCTTAACCGC GCCCGTGTAT GAACCGTGGA TAGATCTGCG CGCCACGTTG TGCATTCTCG ATCCACGCCT GCTGCTGGAC GAAAAAAGCG CCAGCAATGA AAACTTCCGT GACCAGCTGG CTGCCGCAGA CATTATTGTC GCCAATAAAT CCGACCGTGC GACGCCCGAA AGTGAGCAAG CGCTACAGCG CTGGTGGCAG CAAAATGGTG GCGATCGGCA ATTAATTCAC AGCGCGCATG GGAAAGTTGA CGGTCATCTT CTGGATTTGC CGCGTCGCAA TTTAGCCGAG TTGCCCGCCA GCGCCGCGCA TTCTCATCAG CATGTCGTGA AAAAAGGGTT AGCAGCGTTA AGCCTGCCAG AGCATCAACG CTGGCGTCGC AGTCTGAACA GCGGGCAAGG ATATCAGGCC TGCGGCTGGA TATTCGACGC TGATACGGTA TTCGACACCA TTGGCATTCT GGAATGGGCG CGACTTGCAC CGGTAGAACG CGTCAAAGGC GTGCTGCGTA TTCCCGAAGG GCTGGTGCGA ATCAACCGTC AGGGCGATGA CCTGCACATT GAAACGCAAA ACGTTGCGCC ACCGGACAGC CGTATTGAGC TGATTTCCAG CAGCGAAGCT GACTGGAATG CCTTGCAGAG CGCGCTGTTG AAGCTTCGTT TAGCGACTAC CGCGTAA
|
Protein sequence | MTRTNLITGF LGSGKTTSIL HLLAHKDPNE KWAVLVNEFG EVGIDGALLA DSGALLKEIP GGCMCCVNGL PMQVGLNTLL RQGKPDRLLI EPTGLGHPKQ ILDLLTAPVY EPWIDLRATL CILDPRLLLD EKSASNENFR DQLAAADIIV ANKSDRATPE SEQALQRWWQ QNGGDRQLIH SAHGKVDGHL LDLPRRNLAE LPASAAHSHQ HVVKKGLAAL SLPEHQRWRR SLNSGQGYQA CGWIFDADTV FDTIGILEWA RLAPVERVKG VLRIPEGLVR INRQGDDLHI ETQNVAPPDS RIELISSSEA DWNALQSALL KLRLATTA
|
| |