Gene EcolC_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1202 
Symbol 
ID6067451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1316881 
End bp1318896 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content55% 
IMG OID641600617 
Productprotein of unknown function DUF699 ATPase putative 
Protein accessionYP_001724195 
Protein GI170019241 
COG category[R] General function prediction only 
COG ID[COG1444] Predicted P-loop ATPase fused to an acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.322003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC TGACTGCGCT TCACACATTA ACAGCGCAAA TGAAACGTGA AGGGGTCCGC 
CGCTTGCTGG TGTTGAGCGG GGAAGAGGGT TGGTGTTTTG ATCATGCGCT TAAGTTACGT
GATGCCTTAC CTGGCGACTG GCTGTGGATT TCGCCGCAGC CAGATGCTGA AAACCACTGT
TCTCCCTCGG CACTACAAAC TTTACTTGGG CGCGAGTTCC GGCATGCGGT ATTCGACGCC
CGCCACGGCT TTGATGCCGC TGCCTTTGCA GCACTTAGCG GAACGTTGAA AGCGGGAAGC
TGGCTGGTTT TGTTACTCCC TGTATGGGAA GAGTGGGAAA ACCAACCTGA TGCCGACTCG
CTGCGCTGGA GTGATTGCCC TGACCCTATT GCGACGCCGC ATTTTGTCCA GCATCTCAAA
CGCGTACTTA CGGCGGATAA CGACGCTATC CTCTGGCGGC AAAACCAGCC GTTCTCGTTG
GCGCATTTTA CTCCCCGTAC TGACTGGCAC CCCGCGACCG GCGCACCACA ACCAGAACAA
CAGCAACTCT TACAGCAGCT ACTGACCATG CCGCCGGGCG TGGCAGCGGT AACGGCTGCG
CGTGGGCGCG GTAAGTCGGC GCTGGCAGGG CAACTCATTT CTCGTATTGC GGGCAGTGCG
ATTGTCACCG CGCCCGCAAA AGCGGCAACG GATGTACTGG CACAATTTGC GGGCGAGAAG
TTTCGCTTTA TTGCGCCGGA TGCCTTGTTA GCCAGCGATG AGCAAGCCGA CTGGCTGGTG
GTCGACGAAG CCGCAGCCAT ACCTGCGCCG TTGTTGCATC AACTGGTATC GCGTTTTCCT
CGAACGTTGT TAACCACTAC GGTGCAGGGC TACGAAGGCA CCGGACGTGG TTTTTTGCTG
AAATTTTGCG CTCGCTTTCC GCATTTACAC CGTTTTGAAC TGCAACAGCC GATCCGCTGG
GCACAGGGAT GCCCGCTGGA AAAAATGGTT AGTAATGCAC TGGTTTTTGA CGATGAAAAC
TTCACCCATA CACCACAAGG CAATATTGTC ATTTCCGCAT TTGAACAGAC GTTATGGCGA
ATCGAGCCAG AAACGCCGTT AAAGGTTTAT CAGTTATTGT CTGGTGCGCA CTACCGGACT
TCGCCACTGG ATTTACGCCG CATGATGGAT GCACCAGGGC AACATTTTTT ACAGGCGGCT
GGCGAAAACG AGATTGCCGG AGCGCTGTGG CTGGTGGATG AGGGTGGATT ATCTCAAGAA
CTCAGTCAGG CGGTATGGGC AGGTTTTCGT CGCCCGCGGG GTAATCTGGT GGCCCAGTCG
CTGGCGGCGC ACGGCAGCAA TCCACTGGCG GCGACATTGC GTGGACGGCG GGTCAGCCGG
ATTGCAGTCC ATCCGGCGCG TCAGCGCGAA GGCGTTGGGC AACAGCTCAT TGCCAGCGCT
TTGCAATATA GGCCTGGCCT CGACTATCTT TCGGTGAGTT TTGGTTACAC CGGGGAGTTA
TGGCGTTTCT GGCAACGCTG CGGTTTTGTG CTGGTGCGAA TGGGTAATCA TCGTGAAGCC
AGCAGCGGTT GCTATACGGC GATGGCACTG TTACCGATGA GTGATGCGGG TAAACAGCTG
GCTGAACGTG AGCATTACCG TTTACGTCGC GATGCGCAAG CTCTCGCAAA GTGGAATGGC
GAAACGCTTC CTGTTGATCC ACTAAACGAT GCCGTCCTTT CTGACGACGA CTGGCTTGAA
CTGGCCGGTT TTGCTTTCGC TCATCGTCCG CTATTAACAT CGTTAGGTTG CTTATTGCGT
CTGCTACAAA CCAGTGAACT GGCATTACCG GCGCTGCGTG GGCGTTTACA GAAAAACGTC
AGCGACGCGC AGTTATGTAC CACACTTAAA CTTTCAGGCC GCAAGATGTT ACTGGTCCGT
CAGCGGGAAG AGGCCGCGCA GGCGCTGTTC GCACTTAATG AGGTTCGCAC TGAACGTCTG
CGCGATCGCA TAACGCAATG GCAATTTTTT CACTGA
 
Protein sequence
MAELTALHTL TAQMKREGVR RLLVLSGEEG WCFDHALKLR DALPGDWLWI SPQPDAENHC 
SPSALQTLLG REFRHAVFDA RHGFDAAAFA ALSGTLKAGS WLVLLLPVWE EWENQPDADS
LRWSDCPDPI ATPHFVQHLK RVLTADNDAI LWRQNQPFSL AHFTPRTDWH PATGAPQPEQ
QQLLQQLLTM PPGVAAVTAA RGRGKSALAG QLISRIAGSA IVTAPAKAAT DVLAQFAGEK
FRFIAPDALL ASDEQADWLV VDEAAAIPAP LLHQLVSRFP RTLLTTTVQG YEGTGRGFLL
KFCARFPHLH RFELQQPIRW AQGCPLEKMV SNALVFDDEN FTHTPQGNIV ISAFEQTLWR
IEPETPLKVY QLLSGAHYRT SPLDLRRMMD APGQHFLQAA GENEIAGALW LVDEGGLSQE
LSQAVWAGFR RPRGNLVAQS LAAHGSNPLA ATLRGRRVSR IAVHPARQRE GVGQQLIASA
LQYRPGLDYL SVSFGYTGEL WRFWQRCGFV LVRMGNHREA SSGCYTAMAL LPMSDAGKQL
AEREHYRLRR DAQALAKWNG ETLPVDPLND AVLSDDDWLE LAGFAFAHRP LLTSLGCLLR
LLQTSELALP ALRGRLQKNV SDAQLCTTLK LSGRKMLLVR QREEAAQALF ALNEVRTERL
RDRITQWQFF H