Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5836 |
Symbol | |
ID | 8736312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 6248573 |
End bp | 6252853 |
Gene Length | 4281 bp |
Protein Length | 1426 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646506463 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003397612 |
Protein GI | 284047272 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.278516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCTCCT CCTCCGCGCC GCGCAAGCGG CGCTGGTCGA GCCTGCCCGC GGCGCTCGCC GCGGCGGCGC TGATCGTCGC GCTGCCGCCG GCCGCGCACG CCGGCGGACT CGACCTGATC GACGAGACCG AGCAGATCGG GCCGGGCGTC AGCATGCGCC ACCTCAAGAC GCTCGAAGCC GGCGGCTGGT TCGACTACCA GCTGCTGACG GCCAGACTGC AGGGCGGCGT CGTCACGAGC GACCTGCTCT CCGGCGACAG CGTGACCGAG GCCGGGCCGA TCTCGAAGAA GGCCGACAGA GCCGGCGCGG TCGCGGGCGT CAACGGCGAC TTCTTCGACA TCAACAACTC CAACGCGCCG CAGAACCTCG CCGTCAGAGG CGGTGAGCTG CTCAAGAGCG CGAACTTCGG GCTGACCGCC CCGGCCACCG GCGTCACGAG AGACGGCATC GGCCAGCTGC TCAGCACGAC GCTCGACGCG AAGGCGACGT TCGGCGGCGC CGACCTTCCC GTCGCCGCCC TCAACGCCGC CGGCGCCGTT CCCGCCGACG GCTACGTCGC CTTCACCCCG AAGTGGGGCA GATCCAGCCG CGCGCGCAGC CTCGCCGGCG TGGCGAACGT CGCCGAGGCG CTCGTCACCG ACGGCAGAGT CGTCGCCGTC TCCGACGGCG TGGGCGCGGG CGAGATCCCC GCCGGCAGCT TCTACCTCGT CGGCCGCGAG AGCGCCGCCG ACGCGATCCG TGCGCTGAGA GCCGGCGACG AGGTCAGACT CGCGTACGGT CTCAGCGGCG ACGTCGCGCA GCAGCTCCAG TTCGCGATCG GCGGCAACGA AGTGCTCGTC CGCGACGGGC AGGTCGTCGG CAGCGACCAG TCGGTCCACC CGCGCACGGC GATCGGCTTC AAGGACGGCG GCAGAACGCT GCTGCTGTTC GTCGCTGACG GCCGCCAGAC GCAGGTGCTC GGCATGACGA CGCAGAAGGT CGCGCAGCTG CTGCGCGACG CGGGCGCCGA GACCGCGATG AACCTCGACG GCGGCGGCTC GACGACACTC GTCGCCCGCC CGCTCGGCGA CAGCAGAGCG GCGGTCCGCA ACACGCCCTC CGACGGCGCC GAGCGCCACG ACCCCAACGG CGTCGGCCTG TTCGTCAGCG CCGGCAACGG CCAGGTCGAG CAGCTCGTGC TCTCGCCCTC CGGCGAGCAG GCGCGCGTCT TCCCGGGCCT GCACCGCACG CTGCGCGTGA AGGCGGTCGA CGACCACCAG ACGCCGGTCG CGCTCGCGCG CGGCGATGTC CGCTGGAGCA GCAACGAGGG CAGTGTCGAC GGCGGCCTGC TGCGGGCGCC CGAGAACGTC TTCGGGCAGA TCCGCGTGAA GGCGACGACC GACACCGCGC AGGAGGAGGC CGTCGTCAGA GTGCTCGGCG AGCTGCACGC GCTCGAGCTG TCCTCGCAGC GGCTCTCGTT CTCGGCGACC GGCGCCGAGC AGGCGCGGAC GCTGAAGGTC ACCGGCCGCG ACGCGCACGG CTTCACCGCC CCGGTCGAGT CGGCCGACCT GGAGCTCGAC TACGACGACG CGGTCGTCAA GGTCACCCCC TCCGGCGACG CGCTGAAGAT CACGCCGCTG CAGGCGGGCG GCACGGTGCT GACCGTCTCC GCCGGATCGC AGTCGGTCAA GCTGCCGATC TCCGTCGGCG TCCAGACGCA CGTCAACGAC TACTTCCTGA CGACCAGCCC GATCGCCAGC GGCAACTGGA TGTGGACGGG CACGGGGACG ATCAGACGGA CGCTCACCGA CACCGCCGAG GGCGTCAGAG TCGACTACAC CGCCGGCCGC AACATAGGGA TCACCTCCAG AGGCACGGTC GGGCAGTTCG CGCTGCCGGG CGCGCCGCTG AGAGTTCGCG TGAGAGTGAA GTCGAGCGCG AACGTCAGCC TCACCTACGG TGCCTTCGCG CAGGCGGACG GCACCTACAA GAACCTCTAC GGCGCGGCGC TGAAGCCGGG CTGGAACGAC GTCGAGTTCG CGCTCCCGGC CGACACGAAG TACCCGATCA AGCTCGACAC CTTCCAGGCG ATCGAGACGA CCGTCGCTGC GCAGAGAGAC GGCTCGATCG TCGTCGGCGA GGTCTCGGCC GACGTGCCGA GCGAGGTCGA GCTGCCCGAG CAGGAGCCGC TGCGCTCCGA CCGGCTGCTG TCCGCCGACG GCGACCTGCA GGAGGGCGCG GACTTCAGCT TCGCGACGCT CTCCGACGTC CAGTTCACGC ACGTCAACCA GGAGATGGTC CCGGTCGCGG TCCAGGCGCT GCGGCGGATC CGCGCGACGA GACCCGACCT CGTCGTGCTC AACGGCGACA TCGTCGACCT CGGCGCCGCC GAGGACATGA CGCTCGCGCG CAGAACGCTG GAGGCAGGCG GCTGCCAGCT CGTCCCGCTC GGCACGCCGG ACGTCCCCAC GCCGACCGCA GACACGGTTC CGTGCCTCTA CGTCCCCGGC AACCACGAGT CGTACGTCGC CGGCGGCCAG GGCACGCTCG ACGCGTTCAA GGCCGAGTTC GGCAGACCGT ACGGCTACAC CGACCACAAG GGCACCCGCT TCATCACGCT CAACAGCTCC TACGGGTCGC TGCGCGGCTC GGACTTCGCG CAGCTGCCGA TGCTGCAGGA GGCGCTGGAG CAGGCGGCGG GCGAAGACGG CATCGACAAC GTGATGGTCT TCGCCCACCA CCCGGTCGAC GATCCCGCCG AGACGAAGTC GAGCCAGCTC GGCGACCGCA CCGAGGTCCA GCTCGTCAAG AAGCTGTTGT CGGACTTCCG CGACGACAGC GGCAAGGGCG TCGCCATGGT CGGCTCGCAC GCGCAGATCA TGAACGTCCG CCGCGAGGAG GGCGTCCCGT ACGTCGTGCT CCCGTCGTCG GGCAAGGCGC CGTACGGCAC GCCCGACCGC GGCGGCATCA CCGGCTGGGT CCGCTGGGGC GTCGACAGCG ACGAGAACGC CGCGGGCGAC TGGCTCGAAG GCGACGTGCG CCCGTTCGCG CAGACGATCG ACCTGCAGGC ACCGGCGACG CTCGAGGTCG GCAACAGTGC GCCGCTCGGC GGCTCGCTCG TGCAGCCGAG CGGGGTCAGA AACGGCAGCC GCACCGTGCC GCTGCGCTAC CCGCTGTCGC TGCGCTGGTC CGGCAGCGAG CAGCTCGCGA TCGGCAGCGG CGACGACGCG GTCGAGGCGG CCCGCGAGGC CGACAAGACC GCGATCCTCG ACCCGCAGAC GGGCGCGCTG ACGGCGCTCT CGACCGGCAC GGTCGAGGTC ACGGTCGCGG CCGACTCGAT GCGCGAGGGA GACGACCTCG CGCCGATCAC GGCGACGAAG ACGATCGCAG TCGCGCCGTC GACCGCTCCC GGCCCGAAGG CGTGGATCAG CGCGCCGGTC TTTCCCGACC AGGCTGCCAC GACGATCGGC GCCGGGCAGC CGGTGACGGT CGCCAACACC GGTGAGGAGC CGCTCCAGGT CGGGCTCGAC AGAGTGCTCG CGCTCGAAGG CCCGCAGGGC GACTTCGTCG TCGCCGACGA CAGCTGCAGC GCGGCGCCCG TCGCGCCGGG CGCGTCGTGC ACGGTGCTCG TGCGCTTCGC CCCGTCGCGC GAGAACGCGA GATCGTCGGC GCGGTTCGTC TTCCGCGACA ACACCGCCGA GCAGCGTCAC ACGGTGACGA TCAACGCGAC CTCGACCGGT CTGCCGAGAG GCGACAAGGG CGACCAGGGC CAGCCCGGCA TCCCGGGCGA GGACGGTCCC GCCGGCCCGC AGGGTCCGCA GGGCCCCGCC GGCAGAGACG GGCCGCAGGG TCCGGCCGGC AGAGACGGGC CGCAAGGCGA GTCCGGCCCG ATCGGCCCGC AGGGTCCGGC CGGCGGCGAC GGCGCGCAGG GCCCGGCCGG CGCCAAGGGC GACACCGGCG CGCCCGGTGC GACCGGCGCC AAGGGCGAGA CGGGCGAGCG CGGCGAGAAG GGCGCCAAGG GCGACCGCGG CGCGGCCGGC CGGGACGCGC TCGTCACGTG CACGGTCAGA GGCGGCGTCT ACCAGAGAGT CACGTGCGTC GTGACGTACT CGAGCAGAAG CGCGCTGCGC AACGCGAAGG TCAAGGGCAC GTCGAAGGCG CGCCTGACGC GCGCCGGGCG CACCTACGCC AGCGGCCGCG TCGGCTCGCT CAGAGCGAGC CGCACGGTCA GCCGCGGGCG CTACACGCTG CTCGTCGGCA GCGGCAAGAA CGCGGCGAAG GTGGCCGTGA CCGTCCGCTA G
|
Protein sequence | MTSSSAPRKR RWSSLPAALA AAALIVALPP AAHAGGLDLI DETEQIGPGV SMRHLKTLEA GGWFDYQLLT ARLQGGVVTS DLLSGDSVTE AGPISKKADR AGAVAGVNGD FFDINNSNAP QNLAVRGGEL LKSANFGLTA PATGVTRDGI GQLLSTTLDA KATFGGADLP VAALNAAGAV PADGYVAFTP KWGRSSRARS LAGVANVAEA LVTDGRVVAV SDGVGAGEIP AGSFYLVGRE SAADAIRALR AGDEVRLAYG LSGDVAQQLQ FAIGGNEVLV RDGQVVGSDQ SVHPRTAIGF KDGGRTLLLF VADGRQTQVL GMTTQKVAQL LRDAGAETAM NLDGGGSTTL VARPLGDSRA AVRNTPSDGA ERHDPNGVGL FVSAGNGQVE QLVLSPSGEQ ARVFPGLHRT LRVKAVDDHQ TPVALARGDV RWSSNEGSVD GGLLRAPENV FGQIRVKATT DTAQEEAVVR VLGELHALEL SSQRLSFSAT GAEQARTLKV TGRDAHGFTA PVESADLELD YDDAVVKVTP SGDALKITPL QAGGTVLTVS AGSQSVKLPI SVGVQTHVND YFLTTSPIAS GNWMWTGTGT IRRTLTDTAE GVRVDYTAGR NIGITSRGTV GQFALPGAPL RVRVRVKSSA NVSLTYGAFA QADGTYKNLY GAALKPGWND VEFALPADTK YPIKLDTFQA IETTVAAQRD GSIVVGEVSA DVPSEVELPE QEPLRSDRLL SADGDLQEGA DFSFATLSDV QFTHVNQEMV PVAVQALRRI RATRPDLVVL NGDIVDLGAA EDMTLARRTL EAGGCQLVPL GTPDVPTPTA DTVPCLYVPG NHESYVAGGQ GTLDAFKAEF GRPYGYTDHK GTRFITLNSS YGSLRGSDFA QLPMLQEALE QAAGEDGIDN VMVFAHHPVD DPAETKSSQL GDRTEVQLVK KLLSDFRDDS GKGVAMVGSH AQIMNVRREE GVPYVVLPSS GKAPYGTPDR GGITGWVRWG VDSDENAAGD WLEGDVRPFA QTIDLQAPAT LEVGNSAPLG GSLVQPSGVR NGSRTVPLRY PLSLRWSGSE QLAIGSGDDA VEAAREADKT AILDPQTGAL TALSTGTVEV TVAADSMREG DDLAPITATK TIAVAPSTAP GPKAWISAPV FPDQAATTIG AGQPVTVANT GEEPLQVGLD RVLALEGPQG DFVVADDSCS AAPVAPGASC TVLVRFAPSR ENARSSARFV FRDNTAEQRH TVTINATSTG LPRGDKGDQG QPGIPGEDGP AGPQGPQGPA GRDGPQGPAG RDGPQGESGP IGPQGPAGGD GAQGPAGAKG DTGAPGATGA KGETGERGEK GAKGDRGAAG RDALVTCTVR GGVYQRVTCV VTYSSRSALR NAKVKGTSKA RLTRAGRTYA SGRVGSLRAS RTVSRGRYTL LVGSGKNAAK VAVTVR
|
| |