Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1629 |
Symbol | |
ID | 6146024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1619035 |
End bp | 1619925 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616505 |
Product | hypothetical protein |
Protein accession | YP_001743683 |
Protein GI | 170681992 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00134354 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGA AGACAACGGA AATTGATGCC ATCTTGTTAA ATCTCAATAA GGCTATCGAT GCCCACTACC AGTGGCTGGT AAGTATGTTT CACAGCGTGG TCGCGAGAGA TGCCAGTAAG CCAGAAATAA CGGATAACCA TTCTTATGGA CTGTGCCAGT TTGGTCGGTG GATTGATCAT CTGGGGCCAC TCGATAACGA TGAATTACCT TACGTTCGGC TAATGGATTC TGCCCATCAA CATATGCATA ACTGTGGTCG GGAATTAATG TTGGCTATTG TTGAAAATCA CTGGCAGGAC GCGCATTTCG ACGCTTTTCA GGAAGGGTTG CTTTCTTTTA CTGCGGCATT AACCGATTAC AAAATTTATT TACTGACGAT CCGTAGCAAT ATGGATGTTT TGACGGGATT GCCGGGTCGT CGGGTTCTTG ATGAATCCTT CGATCATCAG TTACGCAACG CTGAGCCTCT GAATCTTTAT TTAATGTTGT TGGATATTGA CCGATTTAAA TTGGTTAATG ATACCTACGG GCATTTAATC GGCGATGTAG TATTACGCAC CCTGGCAACT TACTTAGCCA GTTGGACGCG TGATTACGAA ACGGTTTATC GCTACGGGGG CGAAGAATTT ATCATTATTG TCAAAGCGGA TAATGATGAA GAAGCATGTC GTGCAGGTAT CAGAATTTGC CAGTTAGTCG ATAACCATGC CATCACACAT TCTGAAGGGC ATATCAACAT TACCGTGACA GCAGGTGTGA GTCGCGCATT TCCTGAAGAG CCTCTGGATG TGGTCATTGG AAGAGCGGAC CGGGCAATGT ATGAGGGTAA GCAAACCGGA AGAAATCGCT GTATGTTTAT TGACGAACAA AACGTGATTC ACCGAGTTTA A
|
Protein sequence | MIKKTTEIDA ILLNLNKAID AHYQWLVSMF HSVVARDASK PEITDNHSYG LCQFGRWIDH LGPLDNDELP YVRLMDSAHQ HMHNCGRELM LAIVENHWQD AHFDAFQEGL LSFTAALTDY KIYLLTIRSN MDVLTGLPGR RVLDESFDHQ LRNAEPLNLY LMLLDIDRFK LVNDTYGHLI GDVVLRTLAT YLASWTRDYE TVYRYGGEEF IIIVKADNDE EACRAGIRIC QLVDNHAITH SEGHINITVT AGVSRAFPEE PLDVVIGRAD RAMYEGKQTG RNRCMFIDEQ NVIHRV
|
| |