Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3749 |
Symbol | |
ID | 6068088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4100661 |
End bp | 4102163 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603164 |
Product | protein of unknown function DUF853 NPT hydrolase putative |
Protein accession | YP_001726683 |
Protein GI | 170021729 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0662864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC CGTTACGCTG CAAAAACTGG CAGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CACAAGGCTT AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCAG ACCTGGGGCC GCTGTTGCTG GCGCGGCTGT TGAATCTCAA CGATGTGCAG TCTGGCGTGC TGAATATCAT CTTCCGCATT GCTGACGATC AGGGGCTGTT ACTGCTCGAC TTTAAAGATT TGCGGGCGAT TACCCAGTAC ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGAA ATATCAGTAG CGCATCGGTT GGTGCCATCC AGCGCGGATT ACTGTCGCTG GAGCAACAAG GTGCGGAGCA TTTCTTTGGC GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATG CCAACGGTAA AGGCGTTATC AATATCCTCA GCGCCGAAAA GCTTTATCAG ATGCCGAAAC TATATGCCGC CAGCCTGTTG TGGATGCTCT CGGAGTTGTA TGAACAATTG CCGGAAGCAG GCGATCTGGA GAAGCCGAAA CTGGTGTTTT TCTTCGACGA AGCACATCTG CTGTTTAATG ACGCACCGCA GGTACTGCTG GATAAGATTG AGCAGGTGAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT TCGCAAAACC CGTCTGATAT TCCGGATAAC GTGCTCGGGC AGCTCGGTAA TCGCGTTCAA CACGCTTTGC GTGCTTTTAC GCCCAAAGAT CAGAAAGCGG TAAAAGCTGC GGCGCAAACC ATGCGGGCCA ATCCGGCGTT TGATACCGAA AAGGCGATTC AGGAACTGGG CACCGGCGAG GCGTTAATCT CGTTTCTTGA TGTGAAAGGA AGTCCTTCAG TGGTGGAGCG GGCGATGGTG ATCGCGCCTT GTTCGCGAAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CCTGATTAAT CACTCCCCGG TGTATGGCAA ATATGAGGAT GAGGTGGACC GAGAATCCGC CTATGAGATG TTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCTGCCGC GAAAGGGAAA GAGGTGGCGG TGGATGACGG TATTCTTGGT GGATTGAAGG ATATTTTGTT TGGCACTACC GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCCAAAAG CGCCGCTCGC CAGGTGACGA ATCAGATTGT ACGCGGGATG TTGGGGAGTT TGCTGGGGGG GAGAAGAAGG TAA
|
Protein sequence | MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK GDLTGIAQAG TASEKLLTRL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV GAIQRGLLSL EQQGAEHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE ALISFLDVKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM LQKGFQASTE QQNNPAAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR QVTNQIVRGM LGSLLGGRRR
|
| |