Gene EcolC_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3887 
Symbol 
ID6064354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4262336 
End bp4265695 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content39% 
IMG OID641603301 
Producthypothetical protein 
Protein accessionYP_001726816 
Protein GI170021862 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTGTGA AAACTCCAGT TAACCCACTT TTGCAATGGC TTAACATGTT TTTTAGTCGC 
CGTTCATTAT CCGGAGCTGA TGGACGGGCG TTATATGCTT ACCGTTGTAC TGATACAGAG
TACGAAAGTT TGGCAGAACT ACTGCGTACA TATGCCCCCC GTAGTTATCC AAGAACGATA
TTCATTTCCT ATAGCGATGT TCTATTTAGC ATATATGCCG CAGAGTTTAT CCGCCGGACT
CATACGGTCG GACACCCTAA ATGGGACACA ATTTTAGATT CTATTAACTG GAAAGTTCCG
TATGTTCATC GACAGAAGCT GGTTAATGAT GGCATTCGCT ACTGGAAAAG AAAGATAAGA
AACCTGGGGC AAGCTTCGGG TTATCTCCAT ACACTGGCTT GTGAAGGTGG TTTACCTATC
CGCATGATTG AAAATGAGAG CGGTTATTTG ATTACCTATT TCAGAAGGAT ATATCAGGCG
CTTCGTGGGC AATCATCTCA ATATCCTGCC GCTAAAATTG CACAAGAATT AGGCGATACG
ATTCCTGTCA CAATGCAAAA CGAACTGGTC TATGAAATAG CCGGTGAATT CTGCGAGACT
CTCTGCCGAT TACTTAGTGA GCATCCTCCT CATAGCAGTG ATCCCGTTTC TGCATTACGT
AAGCTCTCTC CAGACTGGCA TCTTCAACTT CCACTCGTTC TCCCCGAAGC GAATGCTGCA
GAGATTGTAA GGCGATTACT TTCTCAATCT TCTGAAATAC GCAGTGCAAG TAGTCTTCAG
GTTGAAAGAA TCTGGGTCGA TGTTGATGAC AGTTGGTATT GCGATGCCCG GTTCCGATTC
CCGGCCACAA TGCGTACAGA ACAGTTAACC TCTTTGTTTG AATGCCATAT TCAGCCGGAG
CAGACCCGAC TCATCATTTC AGGAAAATGG AAAAATGGCG GGGCAAGGTT GGCAATGCTA
AGCCGTTATG AGCAGCAAGA TTGGCGGGTC GAGTTATTAC CTATTGCGAT GCAAAAACTC
TCTGGTGCAG ATGCAATGGC CGAAATCTCA TTGTCGCTTC ATGAAGGTCC AATTCTGTTA
GGTCACACAA TTCCAAAAGG CGGTTATGAA CTTACTGAAG AACTTCCCTG GGTTTTTGAA
GCGATGAATG AGAGTGAATC GCAGCTAAAA CTTGTTGGTA TGGGGTCTGT GAGTTCCAGG
CTAAATGCTC TGTTTATTTC GCTACCGAAA AATAGCCATT TAGATATTAG TGGAGAAGGT
GAGTTTGATA TCCCAAGGTT GCTAAAAAAC AGTGAACGCA GCTTAACTAA AATAAGTGGA
GTATTTAGCG TTGTTTTACA TGATGGTGCT GTTTGTACAA TTCGCACCCA GCAACTTTAT
GATTCTGCGA TTGAATATTA TATTAAATCG ACAGAAGTTG AACTGGTTAA ATCGGATTAT
CCTGTCCATC GAGCCTGGCC GAAGATCGGT TGGAAAAAGG ATCTGCAATA TGGCATTGTA
CCGGAGAAAG AACTTTTTTG GCGTTCCATT CGTTCAGGTA ATAATGCCTG GTATTCTGTC
GCATCAGAAA TGCCAAAAGG ACAGATAGAA GTCCGGCGAA TAGTTAATGA TGAGGTTTTA
TTTAGCGGTA AAGTTGTAGT TTTACCAGCA GACTTCGATA TTAATATTAT ACCTGAGAGT
GCTCAGCAAG GCATTATTAT GCTTTCTGGT ATAACGGATA CTAGAATTGA TAAATATTCA
AACAATGAAA AAGTAACACT TAAGTCCGAT TATTCACAAA ACGAGTGTGC TATTTATTAT
AATTCATCGC TGATGCTGGA AAATACCGTT GATTTACGGG TCTCCTGGAA AGATGGTTCT
AATCTTAAAC TATTATTACC TAAGCCGGTT AGCGGTGGCC GATTTGTAAC TAATGATGGT
TCTGTTCATT TTGATGGTGT GGCATCTATA GCACATTTGC ATGGAATAGA TGCTGAGTTA
TTAACCATAT CATGTGCTGG AAGAGGATAT CTTAATATCG AGTTATTGGA TGAAAATCCA
GTAGCTGAAA AATTCCGTTA TTTACATGCC GACCTTCCTC TTTTATCAGG ACGCAATGAC
AAATTACAAC AAATTTCACT TTATGAGAAC TATAATTTGC TAAATGCTAT GCTGGCATGT
GCATGGAACA GCAATAGTAC ATTATGTGTT GATTTTTACT CTGACAGATT TGGAAAGGAT
AAAGCAACAC TCAATATTAA ACGCTACGAT GGTAGTTTTA TTGAACACGA TCAAGGGTTA
CTGGTTGATA TAAAAAATTC TGTTGTTTTT CCTGCAAATA GAATAGACGA ACTGGTTGTT
GATGCTATTT CTCTTAAGAA TCCTGGCTTA CACATATCAT TGTTAAAAAA AGATGAGTTT
GCTTATGATC TTTCAGCTCT GAATGTTCAG GATAGTCCAT GGTTAATTGT GGGAAAACTT
GATGGTACAG CTCGCATTGC ACCAGTAATT AAATGGATGC TACCTGTATT GCAGACAAAT
GATTTATTAC TAAATGCTCT ATGCGAAGCA GACCCAGAAC AGCGCAAAAA AAATTTTAAT
GAGCTAATTT TTGAAATAGA TAACAACCCA TTGCAAAATT ATTGCTGTTT ATTAACAGAG
TATATTAAGA AATACAAAAT GAATAATGGC TTATCCTTGC TGGATCTGGA CTTGTTCAGA
TGTATTTCGA GTAATTACCG CGTGGTTGTT CAATTGTTAA TATCATCATG TCTTTCTGGT
GATAGCGATA CGATTTATGA TATACAGGAA GAATTACCCT TTTCATGGGG ATGGATTCCC
GTTTCAATCT GGAAAGATGT TTTCCAAAAA TGTTGGACTT ATCTGGAGAA ACAGATTAAC
GATAAAACAT TAGCATTACA TATATTGCAA CCCTTTATTG CTTTTATGAA CCATCGTGCA
CATATCGATC GTCGGCTGGC TCCGATTGCG AATATGTTAC TTACATATAG TGAGAGCCTA
CCAACCGGTT GTGATGTATT GCCAACTGTT AGTCGTGAGC AGTTTAATGA AGCTAAACAG
ATGCTATTAA GGAACCCCGA CAGCTTTGGG CGTATCAGTA TCTTCCCTAA AGAACTTTGG
TCTAGTGCTA TTACTCCAGA GTTAAAATCT GTTTTTAATA AGCTTTGGAT TAAAAATAAA
TATCACTCAC GGCTTGAAAA ACGTTTTAAT TTGATGTTAG TCGCAGCGCT GTTAACCCAA
AAAGATAATA ACCTGATACA TCAACTGTCT GCGCTTTTTG AATTTCACTA TCAGCAAGCC
CCGCAGCAAT TAGGGGTAAT CTATCAATAT TATTTTGAAC AAGCAGGAGT ATGTCATTGA
 
Protein sequence
MLVKTPVNPL LQWLNMFFSR RSLSGADGRA LYAYRCTDTE YESLAELLRT YAPRSYPRTI 
FISYSDVLFS IYAAEFIRRT HTVGHPKWDT ILDSINWKVP YVHRQKLVND GIRYWKRKIR
NLGQASGYLH TLACEGGLPI RMIENESGYL ITYFRRIYQA LRGQSSQYPA AKIAQELGDT
IPVTMQNELV YEIAGEFCET LCRLLSEHPP HSSDPVSALR KLSPDWHLQL PLVLPEANAA
EIVRRLLSQS SEIRSASSLQ VERIWVDVDD SWYCDARFRF PATMRTEQLT SLFECHIQPE
QTRLIISGKW KNGGARLAML SRYEQQDWRV ELLPIAMQKL SGADAMAEIS LSLHEGPILL
GHTIPKGGYE LTEELPWVFE AMNESESQLK LVGMGSVSSR LNALFISLPK NSHLDISGEG
EFDIPRLLKN SERSLTKISG VFSVVLHDGA VCTIRTQQLY DSAIEYYIKS TEVELVKSDY
PVHRAWPKIG WKKDLQYGIV PEKELFWRSI RSGNNAWYSV ASEMPKGQIE VRRIVNDEVL
FSGKVVVLPA DFDINIIPES AQQGIIMLSG ITDTRIDKYS NNEKVTLKSD YSQNECAIYY
NSSLMLENTV DLRVSWKDGS NLKLLLPKPV SGGRFVTNDG SVHFDGVASI AHLHGIDAEL
LTISCAGRGY LNIELLDENP VAEKFRYLHA DLPLLSGRND KLQQISLYEN YNLLNAMLAC
AWNSNSTLCV DFYSDRFGKD KATLNIKRYD GSFIEHDQGL LVDIKNSVVF PANRIDELVV
DAISLKNPGL HISLLKKDEF AYDLSALNVQ DSPWLIVGKL DGTARIAPVI KWMLPVLQTN
DLLLNALCEA DPEQRKKNFN ELIFEIDNNP LQNYCCLLTE YIKKYKMNNG LSLLDLDLFR
CISSNYRVVV QLLISSCLSG DSDTIYDIQE ELPFSWGWIP VSIWKDVFQK CWTYLEKQIN
DKTLALHILQ PFIAFMNHRA HIDRRLAPIA NMLLTYSESL PTGCDVLPTV SREQFNEAKQ
MLLRNPDSFG RISIFPKELW SSAITPELKS VFNKLWIKNK YHSRLEKRFN LMLVAALLTQ
KDNNLIHQLS ALFEFHYQQA PQQLGVIYQY YFEQAGVCH