Gene EcolC_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3749 
Symbol 
ID6068088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4100661 
End bp4102163 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID641603164 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_001726683 
Protein GI170021729 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0662864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA 
ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC CGTTACGCTG
CAAAAACTGG CAGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA
GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CACAAGGCTT
AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC
TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCAG ACCTGGGGCC GCTGTTGCTG
GCGCGGCTGT TGAATCTCAA CGATGTGCAG TCTGGCGTGC TGAATATCAT CTTCCGCATT
GCTGACGATC AGGGGCTGTT ACTGCTCGAC TTTAAAGATT TGCGGGCGAT TACCCAGTAC
ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGAA ATATCAGTAG CGCATCGGTT
GGTGCCATCC AGCGCGGATT ACTGTCGCTG GAGCAACAAG GTGCGGAGCA TTTCTTTGGC
GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATG CCAACGGTAA AGGCGTTATC
AATATCCTCA GCGCCGAAAA GCTTTATCAG ATGCCGAAAC TATATGCCGC CAGCCTGTTG
TGGATGCTCT CGGAGTTGTA TGAACAATTG CCGGAAGCAG GCGATCTGGA GAAGCCGAAA
CTGGTGTTTT TCTTCGACGA AGCACATCTG CTGTTTAATG ACGCACCGCA GGTACTGCTG
GATAAGATTG AGCAGGTGAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT
TCGCAAAACC CGTCTGATAT TCCGGATAAC GTGCTCGGGC AGCTCGGTAA TCGCGTTCAA
CACGCTTTGC GTGCTTTTAC GCCCAAAGAT CAGAAAGCGG TAAAAGCTGC GGCGCAAACC
ATGCGGGCCA ATCCGGCGTT TGATACCGAA AAGGCGATTC AGGAACTGGG CACCGGCGAG
GCGTTAATCT CGTTTCTTGA TGTGAAAGGA AGTCCTTCAG TGGTGGAGCG GGCGATGGTG
ATCGCGCCTT GTTCGCGAAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CCTGATTAAT
CACTCCCCGG TGTATGGCAA ATATGAGGAT GAGGTGGACC GAGAATCCGC CTATGAGATG
TTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCTGCCGC GAAAGGGAAA
GAGGTGGCGG TGGATGACGG TATTCTTGGT GGATTGAAGG ATATTTTGTT TGGCACTACC
GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCCAAAAG CGCCGCTCGC
CAGGTGACGA ATCAGATTGT ACGCGGGATG TTGGGGAGTT TGCTGGGGGG GAGAAGAAGG
TAA
 
Protein sequence
MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 
GDLTGIAQAG TASEKLLTRL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL
ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV
GAIQRGLLSL EQQGAEHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL
WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV
SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE
ALISFLDVKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM
LQKGFQASTE QQNNPAAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR
QVTNQIVRGM LGSLLGGRRR