Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2366 |
Symbol | |
ID | 8732809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2504716 |
End bp | 2508099 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646502983 |
Product | Htaa domain protein |
Protein accession | YP_003394165 |
Protein GI | 284043825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.527032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0458127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGAA TGGCACGTGC GCCGCGGTCG CTGCCGCGCG CGCTGACGGC CCGCACCCGC GTGCGCCGAG GTGCACTGGC GGTGCTGGCG GCGATCGCCT GCGGGCCGGT CGCGGGCACG GTGGCGGCGC CGGCGGCGCA CGCCGCGCCG ACGGCGATCG AGAGCGGCGA CGGGCTCGAC TGGGGCCTGC GCGAGTCGTG GCGCAGATAC ATCGGAGCGG GCGGCACGAC GGCGAGCGAC GGCGCGACCG TCAACCCCGA CGGCACGTTC CACTTCCCGA TCGGCGGCGG CTCCTACGAC CCCGACACGA GGACGACGGT CGTCAGATTC GGCGGCCGGG TGCAGTTCCT CGGCCACTGC GACGGCGGCG GCTTCGAGCG TCCGTGCGCG CTCGACCTGA CGCTCGCGAA CCCGCGCGTC GAGCTGACCG AGGAGGGCGG CTTCCTCTAC GCGAAGATGG CGAGCCGCCC GATCGAGGGC GGCGAGATCG TCGACCTGCC GGATGTCAGA CTCGCGGCGG TCGACATCGA GGACGCCCAG CCCGCGATCG GCGGCGGGGT CACACGCTGG AGCGGCCTGC CCGCGACGAT CACCGCCGAG GGCTCGACCG TCTTCACCTA CCCGGTCGGC TCGGCGCTCG ACCTGCTCAC GTTCGCCTAC GACGGCCCCG GCGGCAAGCC GGCCGGCGAG ACCTGGAGCG AGCCCGGCAT CGCCCTCTTC GACCCGACTC CGCTGCCGGG CGCGAGCTTC GCGCCGACGC GCCTCTACCC GGGCTTCGAC GCCGGGACGA CGGTCGCGCG CGACGGTCAG CGGATCGCCG TCGTCGACCG TGCGACGCTT GCCCCGCTGT CGGCGAACGC GGTCGTCGAG CCGGTCGGTG GGCGCGACAG CATCGCCGTC GACCCGAGAT CGAGAACGAT CTTCGGGATC GAGACGTTCG GCAGAAAGCG GCTGTTCGCC TACACCTGGG ACGGCGCCGC GCTGACCGGT GGTCCGCTCG CCGGCACCGA CGCCGGCGGC GTCACGACCG ATCCGGCGCT GCAGGGCGCC GGGGTCTGGG ACCCGCACGG GAGACGCTAC CTCGCGGTCC GCGTCTACCC GAACCGCCAG GAGCTGTGGC AGGTCGCGCA GGACGGCGGC GCGTGGGTCG CGAGCAGAAT CGACACGATC CGCGGCATCG ACGGCGTCCC GTTCACGGTC CCGATCGCGA GCCTCGCCGC CGTGCCGAGC GGCGATTTCC GCAACCCGCA GATGTTCGTC GCGGCGACGT GGGCCGGCGG CCCGCTGCTG AAGCTCGCGA CCGACGGCGT CGTCGCCGGC GTCTCGCCGC TGCCGCAGGG CGGGAGCATC CGTGGCGAGG AGCTGATCCG CGTCAGAAAC GGCCTCTATG CCTTCGACGC CGAGGACGGG ATCACGTTCT TTCCGCTCAC CGGCGAGAAC TCGTGGGACA TGATGGAGAA GCCCCGCCCG TCGATCAGAC CGACGGGCCT CCCGCTCGCC GACCTCAACC GCTGGCGCAG CTGGATCGCC GGCGACCGCT CCGCCAGCGT CTTCTACGCG GTCATCACCG GCGGCACTCA GGTGGCGCGG ATCGAGGGCG GCGCCGTCAC GGGCACGTTC GCGCTGCCCG GTGGGCGGAG AGAGCTGAAA GCCTGGGACG ACCGGCTCGC CGGCGTGGCG GCGAACGGCG ACCTGCTGCT CGGCTCCGCC CCCGACGGCG GTCCGGAGAC CGTGACGCGG CTCGCCTACG CGCGCACGAC GCCGTCGATC GCGAGAGCGC CGCAGGACAC CGCGGTCACG CTCGCCGCGG CTGACGGAAG CGGGACGGCG AGATTCGCGG TCGCGGCGGC CGGCGACCCG GCGCCCGCGC TGCGCTGGCA GTCGCGCGTG CCGGGCGGGA GCTGGACCGA CCTGGCCGAC GGCACAAGCG TCGCCGGCGC GAGCACCGCG GCGCTGACGG TGACGGTCGG CGCCGCCGAC TCCGGCCGCC AGTTCCGCGC GATCGCGCAG AACACGGCCG GCGAGGTCGC CGGCGCGCGG GCGACGCTGA CCGTCAAGAC GCCGCCGAGC GTCGTCGTCC AGCCGGACCC CGTGACGGTG CTCGACGGCG CGAGCGCGCG CTTCAGAGTG ATGCCGTCCG GCACGCCGGA GCCCGCGGTC GACTGGCAGC AGCGCGTCAA CGGCTTCTGG CGGCAGGTCG ACCCGCAGTC CGGTGACGTC GTCGCCGACG GCGGGACACT GACGATCCCC TCCGCCACGG TCGCGATGTC TGGCGCGCAG TTCCGCGCGC GCCTGCGCAA CGACGCCGGC ACCGCCTTCT CGCGGCCGGT CACGCTGACC GTCGAGCAGG CGCTGACGCA GCCGGTCCGC TTCGGCGGCG GCCATGTCGA CTGGGGCGTC TCCGAGCGGT GGCGCTGCTA CGTCGTCGGC AACGTCGCAC GCGGCGCGAT CGAGCCCGAG GCCGGCGTCG AGCGGATCCC CGGCACGCTC GCGAGCGGCC AGCTCTGCAA CGGCCGCAAC GCCGGCTCGG AGGCGCTGCG CTTCCCGGTC CGCAGCGGCA GCTTCGACCC GCGTGACGGC AGACTCGAGC TGTCGCTGCG CGGCTCGGTG CGCTTCCGCG GCCACGACTA CCACCGGCCC GGCGACCCGC GGCCGCAGCT CGACACGCGC TTCTCGAACC TGCGGATCGT CGCCGACGGG ACGACCGGCA CGCTCTACGC CGACGCGGTC GGCGCGACGA TGGATCGGCC GGACCCGGTC ACGCGCACGA ACGTCCCGCT CGTGACCGTC GACCTCGCCG GCACCGGCCC GGTGCGACGG CCCGACGGCC TCGACTGGAG CGCGCTCCCG ACGGTCTTGA CGGCCCAGGG CGGCGAGGTC TTCGGCAGCT ACAGAGCCGG TGAGGCGTTC GACCCGCTGA CGCTCGCGCC GGTCTACGGC GAGCCGCAGC CGGATCCGGT CCCCGCGCCG AAGCCCGCCC CGGCCCCGGC CCCGGCTGCG GCGCCCGCGC CGAAGCCGAC GCCGAAGCCC GCCGCGCGTG CGAGCGTCAC CGTCGCGAAG ACGGCGGTCC GGCTCGATCG CCGGCGGACC GCGCGCGTCG CGACGGTCGC CTGCCCGGCG CGCGCGAGAT CCGCGTGCCG GATCGCGGCG CCCGCGCGCG TGCGCGTGCG CGCTGCCGGG CGCAGCTTCG CCGTCCGCGT GCTCGCGCCG AAGACGGTCC GCGCCGGCAG ACGGGCGGCC GTGCGCGTGC GGCTGCCGGC CGTGGCCGCG ACGCGCCTGG CGGGGCACAG AGCGAGCGTG CGGATCGCGC TCGTCACGAC CGTCGACGGC GCGCGCGAGC GCCACACGCT GAACGCGACG ATCACGGCGA AGCGGAGACG CTGA
|
Protein sequence | MHGMARAPRS LPRALTARTR VRRGALAVLA AIACGPVAGT VAAPAAHAAP TAIESGDGLD WGLRESWRRY IGAGGTTASD GATVNPDGTF HFPIGGGSYD PDTRTTVVRF GGRVQFLGHC DGGGFERPCA LDLTLANPRV ELTEEGGFLY AKMASRPIEG GEIVDLPDVR LAAVDIEDAQ PAIGGGVTRW SGLPATITAE GSTVFTYPVG SALDLLTFAY DGPGGKPAGE TWSEPGIALF DPTPLPGASF APTRLYPGFD AGTTVARDGQ RIAVVDRATL APLSANAVVE PVGGRDSIAV DPRSRTIFGI ETFGRKRLFA YTWDGAALTG GPLAGTDAGG VTTDPALQGA GVWDPHGRRY LAVRVYPNRQ ELWQVAQDGG AWVASRIDTI RGIDGVPFTV PIASLAAVPS GDFRNPQMFV AATWAGGPLL KLATDGVVAG VSPLPQGGSI RGEELIRVRN GLYAFDAEDG ITFFPLTGEN SWDMMEKPRP SIRPTGLPLA DLNRWRSWIA GDRSASVFYA VITGGTQVAR IEGGAVTGTF ALPGGRRELK AWDDRLAGVA ANGDLLLGSA PDGGPETVTR LAYARTTPSI ARAPQDTAVT LAAADGSGTA RFAVAAAGDP APALRWQSRV PGGSWTDLAD GTSVAGASTA ALTVTVGAAD SGRQFRAIAQ NTAGEVAGAR ATLTVKTPPS VVVQPDPVTV LDGASARFRV MPSGTPEPAV DWQQRVNGFW RQVDPQSGDV VADGGTLTIP SATVAMSGAQ FRARLRNDAG TAFSRPVTLT VEQALTQPVR FGGGHVDWGV SERWRCYVVG NVARGAIEPE AGVERIPGTL ASGQLCNGRN AGSEALRFPV RSGSFDPRDG RLELSLRGSV RFRGHDYHRP GDPRPQLDTR FSNLRIVADG TTGTLYADAV GATMDRPDPV TRTNVPLVTV DLAGTGPVRR PDGLDWSALP TVLTAQGGEV FGSYRAGEAF DPLTLAPVYG EPQPDPVPAP KPAPAPAPAA APAPKPTPKP AARASVTVAK TAVRLDRRRT ARVATVACPA RARSACRIAA PARVRVRAAG RSFAVRVLAP KTVRAGRRAA VRVRLPAVAA TRLAGHRASV RIALVTTVDG ARERHTLNAT ITAKRRR
|
| |