Gene EcolC_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1798 
Symbol 
ID6065174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1995771 
End bp1998404 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content54% 
IMG OID641601213 
Producthypothetical protein 
Protein accessionYP_001724775 
Protein GI170019821 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.604041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0674591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG AAACGCCCGC TTCGACAACT GAAGCGCAGA TTAAAAATAA ACGCCGTATC 
TCACCTTTCT GGCTGCTGCC TTTCATCGCG CTAATGATTG CCGGTTGGCT GATTTGGGAC
AGTTATCAGG ACCGGGGTAA TACCGTCACC ATCGACTTTA TGTCGGCGGA TGGTATTGTT
CCGGGCCGTA CGCCTGTTCG TTATCAGGGC GTTGAAGTCG GAACAGTGCA GGATATCAGC
CTCAGCGACG ATCTTCGTAA GATTGAAGTC AAGGTCAGCA TCAAGTCCGA TATGAAAGAT
GCGCTGCGCG AAGAGACTCA GTTCTGGCTG GTGACGCCAA AAGCATCGTT GGCAGGTGTC
TCCGGGCTGG ACGCCCTCGT CGGTGGTAAC TATATCGGCA TGATGCCGGG TAAAGGTAAA
GAGCAGGATC ACTTTGTCGC ACTCGATACC CAACCGAAAT ATCGGCTGGA CAATGGCGAT
CTGATGATCC ACCTGCAAGC CCCCGATCTC GGTTCGCTGA ACAGCGGTTC ATTGGTCTAT
TTCCGCAAGA TCCCGGTGGG AAAAGTCTAC GACTATGCCA TCAATCCCAA CAAGCAAGGC
GTGGTGATTG ATGTCCTGAT CGAGCGGCGT TTTACCGACC TGGTGAAAAA AGGTAGCCGT
TTCTGGAACG TTTCCGGCGT TGATGCCAAC GTCAGTATCA GTGGCGCGAA GGTGAAACTG
GAAAGTCTGG CGGCACTGGT TAACGGTGCG ATTGCCTTCG ATTCACCAGA AGAGTCGAAA
CCTGCCGAGG CGGAAGATAC CTTTGGTCTG TATGAAGATC TGGCCCACAG CCAGCGTGGC
GTAATAATAA AACTGGAACT GCCGAGTGGG GCAGGATTAA CCGCCGACTC GACGCCGTTA
ATGTATCAGG GGCTGGAAGT CGGACAGCTG ACTAAACTGG ATTTAAATCC TGGTGGTAAA
GTCACCGGAG AAATGACCGT TGATCCCAGC GTCGTTACGC TGTTACGGGA AAATACCCGC
ATCGAATTAC GCAACCCGAA ATTATCCCTT AGCGATGCCA ATCTCAGCGC CCTGCTGACT
GGCAAAACCT TCGAGTTGGT ACCCGGCGAT GGCGAGCCAC GCAAAGAGTT CGTTGTTGTG
CCAGGCGAAA AAGCACTGCT GCATGAACCT GATGTTCTGA CGCTGACCCT GACCGCACCG
GAAAGTTACG GTATTGATGC GGGTCAGCCG CTCATTCTTC ACGGCGTGCA GGTAGGCCAG
GTTATCGATC GTAAACTCAC CAGCAAAGGC GTCACCTTTA CCGTCGCCAT CGAGCCTCAG
CATCGAGAAC TGGTAAAAGG CGATAGCAAA TTTGTCGTCA ACAGCCGTGT CGACGTGAAG
GTGGGGCTGG ATGGCGTTGA GTTTCTCGGT GCCAGCGCCT CAGAATGGAT TAACGGCGGG
ATACGTATTC TGCCGGGCGA TAAAGGCGAG ATGAAAGCCA GCTATCCACT GTATGCCAAT
CTGGAAAAAG CGCTGGAGAA CAGCCTTAGC GATTTACCCA CCACAACCGT GAGTTTGAGT
GCAGAGACGC TGCCGGATGT GCAGGCAGGA TCGGTAGTGC TCTACCGTAA ATTTGAAGTT
GGTGAAGTTA TTACCGTCCG TCCGCGAGCT AACGCGTTTG ATATCGATCT GCATATTAAG
CCGGAGTATC GCAACCTTCT GACCAGCAAT AGCGTGTTCT GGGCAGAAGG CGGGGCGAAA
GTTCAGCTGA ATGGTAGTGG CCTGACCGTA CAGGCATCCC CGCTCTCCAG AGCATTAAAG
GGAGCCATTA GCTTCGATAA CCTCAGCGGT GCCAGCGCCA GTCAGCGTAA AGGCGACAAA
CGAATTCTGT ATGCTTCCGA AACAGCGGCC CGTGCGGTTG GTGGGCAGAT TACGCTTCAC
GCTTTCGATG CCGGAAAACT GGCGGTCGGG ATGCCAATTC GCTATCTCGG TATTGATATC
GGGCAAATCC AGACGCTGGA TCTGATTACC ACGCGCAATG AAGTACAGGC AAAGGCGGTG
CTCTATCCGG AATATGTCCA GACCTTTGCT CGCGGTGGTA CGCGCTTCTC AGTGGTCACA
CCGCAAATTT CGGCAGCTGG CGTTGAGCAT CTTGATACTA TCCTCCAGCC GTATATCAAC
GTCGAACCAG GCCGGGGCAA TCCTCGCCGC GACTTTGAAT TACAAGAGGC CACCATTACT
GATTCGCGTT ACCTGGATGG CTTAAGCATT ATTGTTGAAG CGCCGGAAGC CGGTTCGTTA
GGCATCGGTA CGCCTGTGCT GTTCCGTGGT CTGGAAGTCG GTACGGTTAC AGGAATGACG
CTGGGGACAT TGTCAGATCG CGTGATGATT GCGATGCGCA TCAGTAAACG CTATCAACAC
CTGGTGCGTA ACAATTCCGT CTTCTGGTTG GCATCGGGTT ACAGTCTGGA CTTTGGTCTG
ACGGGCGGCG TAGTGAAAAC CGGCACCTTT AACCAGTTTA TCCGTGGCGG CATCGCCTTC
GCCACGCCTC CGGGGACGCC ACTGGCACCG AAAGCCCAGG AAGGCAAGCA CTTCCTGTTG
CAGGAAAGTG AACCGAAAGA GTGGCGTGAA TGGGGTACTG CGCTTCCCAA ATAA
 
Protein sequence
MSQETPASTT EAQIKNKRRI SPFWLLPFIA LMIAGWLIWD SYQDRGNTVT IDFMSADGIV 
PGRTPVRYQG VEVGTVQDIS LSDDLRKIEV KVSIKSDMKD ALREETQFWL VTPKASLAGV
SGLDALVGGN YIGMMPGKGK EQDHFVALDT QPKYRLDNGD LMIHLQAPDL GSLNSGSLVY
FRKIPVGKVY DYAINPNKQG VVIDVLIERR FTDLVKKGSR FWNVSGVDAN VSISGAKVKL
ESLAALVNGA IAFDSPEESK PAEAEDTFGL YEDLAHSQRG VIIKLELPSG AGLTADSTPL
MYQGLEVGQL TKLDLNPGGK VTGEMTVDPS VVTLLRENTR IELRNPKLSL SDANLSALLT
GKTFELVPGD GEPRKEFVVV PGEKALLHEP DVLTLTLTAP ESYGIDAGQP LILHGVQVGQ
VIDRKLTSKG VTFTVAIEPQ HRELVKGDSK FVVNSRVDVK VGLDGVEFLG ASASEWINGG
IRILPGDKGE MKASYPLYAN LEKALENSLS DLPTTTVSLS AETLPDVQAG SVVLYRKFEV
GEVITVRPRA NAFDIDLHIK PEYRNLLTSN SVFWAEGGAK VQLNGSGLTV QASPLSRALK
GAISFDNLSG ASASQRKGDK RILYASETAA RAVGGQITLH AFDAGKLAVG MPIRYLGIDI
GQIQTLDLIT TRNEVQAKAV LYPEYVQTFA RGGTRFSVVT PQISAAGVEH LDTILQPYIN
VEPGRGNPRR DFELQEATIT DSRYLDGLSI IVEAPEAGSL GIGTPVLFRG LEVGTVTGMT
LGTLSDRVMI AMRISKRYQH LVRNNSVFWL ASGYSLDFGL TGGVVKTGTF NQFIRGGIAF
ATPPGTPLAP KAQEGKHFLL QESEPKEWRE WGTALPK