Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1798 |
Symbol | |
ID | 6065174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1995771 |
End bp | 1998404 |
Gene Length | 2634 bp |
Protein Length | 877 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601213 |
Product | hypothetical protein |
Protein accession | YP_001724775 |
Protein GI | 170019821 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.604041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0674591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGG AAACGCCCGC TTCGACAACT GAAGCGCAGA TTAAAAATAA ACGCCGTATC TCACCTTTCT GGCTGCTGCC TTTCATCGCG CTAATGATTG CCGGTTGGCT GATTTGGGAC AGTTATCAGG ACCGGGGTAA TACCGTCACC ATCGACTTTA TGTCGGCGGA TGGTATTGTT CCGGGCCGTA CGCCTGTTCG TTATCAGGGC GTTGAAGTCG GAACAGTGCA GGATATCAGC CTCAGCGACG ATCTTCGTAA GATTGAAGTC AAGGTCAGCA TCAAGTCCGA TATGAAAGAT GCGCTGCGCG AAGAGACTCA GTTCTGGCTG GTGACGCCAA AAGCATCGTT GGCAGGTGTC TCCGGGCTGG ACGCCCTCGT CGGTGGTAAC TATATCGGCA TGATGCCGGG TAAAGGTAAA GAGCAGGATC ACTTTGTCGC ACTCGATACC CAACCGAAAT ATCGGCTGGA CAATGGCGAT CTGATGATCC ACCTGCAAGC CCCCGATCTC GGTTCGCTGA ACAGCGGTTC ATTGGTCTAT TTCCGCAAGA TCCCGGTGGG AAAAGTCTAC GACTATGCCA TCAATCCCAA CAAGCAAGGC GTGGTGATTG ATGTCCTGAT CGAGCGGCGT TTTACCGACC TGGTGAAAAA AGGTAGCCGT TTCTGGAACG TTTCCGGCGT TGATGCCAAC GTCAGTATCA GTGGCGCGAA GGTGAAACTG GAAAGTCTGG CGGCACTGGT TAACGGTGCG ATTGCCTTCG ATTCACCAGA AGAGTCGAAA CCTGCCGAGG CGGAAGATAC CTTTGGTCTG TATGAAGATC TGGCCCACAG CCAGCGTGGC GTAATAATAA AACTGGAACT GCCGAGTGGG GCAGGATTAA CCGCCGACTC GACGCCGTTA ATGTATCAGG GGCTGGAAGT CGGACAGCTG ACTAAACTGG ATTTAAATCC TGGTGGTAAA GTCACCGGAG AAATGACCGT TGATCCCAGC GTCGTTACGC TGTTACGGGA AAATACCCGC ATCGAATTAC GCAACCCGAA ATTATCCCTT AGCGATGCCA ATCTCAGCGC CCTGCTGACT GGCAAAACCT TCGAGTTGGT ACCCGGCGAT GGCGAGCCAC GCAAAGAGTT CGTTGTTGTG CCAGGCGAAA AAGCACTGCT GCATGAACCT GATGTTCTGA CGCTGACCCT GACCGCACCG GAAAGTTACG GTATTGATGC GGGTCAGCCG CTCATTCTTC ACGGCGTGCA GGTAGGCCAG GTTATCGATC GTAAACTCAC CAGCAAAGGC GTCACCTTTA CCGTCGCCAT CGAGCCTCAG CATCGAGAAC TGGTAAAAGG CGATAGCAAA TTTGTCGTCA ACAGCCGTGT CGACGTGAAG GTGGGGCTGG ATGGCGTTGA GTTTCTCGGT GCCAGCGCCT CAGAATGGAT TAACGGCGGG ATACGTATTC TGCCGGGCGA TAAAGGCGAG ATGAAAGCCA GCTATCCACT GTATGCCAAT CTGGAAAAAG CGCTGGAGAA CAGCCTTAGC GATTTACCCA CCACAACCGT GAGTTTGAGT GCAGAGACGC TGCCGGATGT GCAGGCAGGA TCGGTAGTGC TCTACCGTAA ATTTGAAGTT GGTGAAGTTA TTACCGTCCG TCCGCGAGCT AACGCGTTTG ATATCGATCT GCATATTAAG CCGGAGTATC GCAACCTTCT GACCAGCAAT AGCGTGTTCT GGGCAGAAGG CGGGGCGAAA GTTCAGCTGA ATGGTAGTGG CCTGACCGTA CAGGCATCCC CGCTCTCCAG AGCATTAAAG GGAGCCATTA GCTTCGATAA CCTCAGCGGT GCCAGCGCCA GTCAGCGTAA AGGCGACAAA CGAATTCTGT ATGCTTCCGA AACAGCGGCC CGTGCGGTTG GTGGGCAGAT TACGCTTCAC GCTTTCGATG CCGGAAAACT GGCGGTCGGG ATGCCAATTC GCTATCTCGG TATTGATATC GGGCAAATCC AGACGCTGGA TCTGATTACC ACGCGCAATG AAGTACAGGC AAAGGCGGTG CTCTATCCGG AATATGTCCA GACCTTTGCT CGCGGTGGTA CGCGCTTCTC AGTGGTCACA CCGCAAATTT CGGCAGCTGG CGTTGAGCAT CTTGATACTA TCCTCCAGCC GTATATCAAC GTCGAACCAG GCCGGGGCAA TCCTCGCCGC GACTTTGAAT TACAAGAGGC CACCATTACT GATTCGCGTT ACCTGGATGG CTTAAGCATT ATTGTTGAAG CGCCGGAAGC CGGTTCGTTA GGCATCGGTA CGCCTGTGCT GTTCCGTGGT CTGGAAGTCG GTACGGTTAC AGGAATGACG CTGGGGACAT TGTCAGATCG CGTGATGATT GCGATGCGCA TCAGTAAACG CTATCAACAC CTGGTGCGTA ACAATTCCGT CTTCTGGTTG GCATCGGGTT ACAGTCTGGA CTTTGGTCTG ACGGGCGGCG TAGTGAAAAC CGGCACCTTT AACCAGTTTA TCCGTGGCGG CATCGCCTTC GCCACGCCTC CGGGGACGCC ACTGGCACCG AAAGCCCAGG AAGGCAAGCA CTTCCTGTTG CAGGAAAGTG AACCGAAAGA GTGGCGTGAA TGGGGTACTG CGCTTCCCAA ATAA
|
Protein sequence | MSQETPASTT EAQIKNKRRI SPFWLLPFIA LMIAGWLIWD SYQDRGNTVT IDFMSADGIV PGRTPVRYQG VEVGTVQDIS LSDDLRKIEV KVSIKSDMKD ALREETQFWL VTPKASLAGV SGLDALVGGN YIGMMPGKGK EQDHFVALDT QPKYRLDNGD LMIHLQAPDL GSLNSGSLVY FRKIPVGKVY DYAINPNKQG VVIDVLIERR FTDLVKKGSR FWNVSGVDAN VSISGAKVKL ESLAALVNGA IAFDSPEESK PAEAEDTFGL YEDLAHSQRG VIIKLELPSG AGLTADSTPL MYQGLEVGQL TKLDLNPGGK VTGEMTVDPS VVTLLRENTR IELRNPKLSL SDANLSALLT GKTFELVPGD GEPRKEFVVV PGEKALLHEP DVLTLTLTAP ESYGIDAGQP LILHGVQVGQ VIDRKLTSKG VTFTVAIEPQ HRELVKGDSK FVVNSRVDVK VGLDGVEFLG ASASEWINGG IRILPGDKGE MKASYPLYAN LEKALENSLS DLPTTTVSLS AETLPDVQAG SVVLYRKFEV GEVITVRPRA NAFDIDLHIK PEYRNLLTSN SVFWAEGGAK VQLNGSGLTV QASPLSRALK GAISFDNLSG ASASQRKGDK RILYASETAA RAVGGQITLH AFDAGKLAVG MPIRYLGIDI GQIQTLDLIT TRNEVQAKAV LYPEYVQTFA RGGTRFSVVT PQISAAGVEH LDTILQPYIN VEPGRGNPRR DFELQEATIT DSRYLDGLSI IVEAPEAGSL GIGTPVLFRG LEVGTVTGMT LGTLSDRVMI AMRISKRYQH LVRNNSVFWL ASGYSLDFGL TGGVVKTGTF NQFIRGGIAF ATPPGTPLAP KAQEGKHFLL QESEPKEWRE WGTALPK
|
| |