Gene Cwoe_5836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5836 
Symbol 
ID8736312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6248573 
End bp6252853 
Gene Length4281 bp 
Protein Length1426 aa 
Translation table11 
GC content73% 
IMG OID646506463 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003397612 
Protein GI284047272 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.278516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTCCT CCTCCGCGCC GCGCAAGCGG CGCTGGTCGA GCCTGCCCGC GGCGCTCGCC 
GCGGCGGCGC TGATCGTCGC GCTGCCGCCG GCCGCGCACG CCGGCGGACT CGACCTGATC
GACGAGACCG AGCAGATCGG GCCGGGCGTC AGCATGCGCC ACCTCAAGAC GCTCGAAGCC
GGCGGCTGGT TCGACTACCA GCTGCTGACG GCCAGACTGC AGGGCGGCGT CGTCACGAGC
GACCTGCTCT CCGGCGACAG CGTGACCGAG GCCGGGCCGA TCTCGAAGAA GGCCGACAGA
GCCGGCGCGG TCGCGGGCGT CAACGGCGAC TTCTTCGACA TCAACAACTC CAACGCGCCG
CAGAACCTCG CCGTCAGAGG CGGTGAGCTG CTCAAGAGCG CGAACTTCGG GCTGACCGCC
CCGGCCACCG GCGTCACGAG AGACGGCATC GGCCAGCTGC TCAGCACGAC GCTCGACGCG
AAGGCGACGT TCGGCGGCGC CGACCTTCCC GTCGCCGCCC TCAACGCCGC CGGCGCCGTT
CCCGCCGACG GCTACGTCGC CTTCACCCCG AAGTGGGGCA GATCCAGCCG CGCGCGCAGC
CTCGCCGGCG TGGCGAACGT CGCCGAGGCG CTCGTCACCG ACGGCAGAGT CGTCGCCGTC
TCCGACGGCG TGGGCGCGGG CGAGATCCCC GCCGGCAGCT TCTACCTCGT CGGCCGCGAG
AGCGCCGCCG ACGCGATCCG TGCGCTGAGA GCCGGCGACG AGGTCAGACT CGCGTACGGT
CTCAGCGGCG ACGTCGCGCA GCAGCTCCAG TTCGCGATCG GCGGCAACGA AGTGCTCGTC
CGCGACGGGC AGGTCGTCGG CAGCGACCAG TCGGTCCACC CGCGCACGGC GATCGGCTTC
AAGGACGGCG GCAGAACGCT GCTGCTGTTC GTCGCTGACG GCCGCCAGAC GCAGGTGCTC
GGCATGACGA CGCAGAAGGT CGCGCAGCTG CTGCGCGACG CGGGCGCCGA GACCGCGATG
AACCTCGACG GCGGCGGCTC GACGACACTC GTCGCCCGCC CGCTCGGCGA CAGCAGAGCG
GCGGTCCGCA ACACGCCCTC CGACGGCGCC GAGCGCCACG ACCCCAACGG CGTCGGCCTG
TTCGTCAGCG CCGGCAACGG CCAGGTCGAG CAGCTCGTGC TCTCGCCCTC CGGCGAGCAG
GCGCGCGTCT TCCCGGGCCT GCACCGCACG CTGCGCGTGA AGGCGGTCGA CGACCACCAG
ACGCCGGTCG CGCTCGCGCG CGGCGATGTC CGCTGGAGCA GCAACGAGGG CAGTGTCGAC
GGCGGCCTGC TGCGGGCGCC CGAGAACGTC TTCGGGCAGA TCCGCGTGAA GGCGACGACC
GACACCGCGC AGGAGGAGGC CGTCGTCAGA GTGCTCGGCG AGCTGCACGC GCTCGAGCTG
TCCTCGCAGC GGCTCTCGTT CTCGGCGACC GGCGCCGAGC AGGCGCGGAC GCTGAAGGTC
ACCGGCCGCG ACGCGCACGG CTTCACCGCC CCGGTCGAGT CGGCCGACCT GGAGCTCGAC
TACGACGACG CGGTCGTCAA GGTCACCCCC TCCGGCGACG CGCTGAAGAT CACGCCGCTG
CAGGCGGGCG GCACGGTGCT GACCGTCTCC GCCGGATCGC AGTCGGTCAA GCTGCCGATC
TCCGTCGGCG TCCAGACGCA CGTCAACGAC TACTTCCTGA CGACCAGCCC GATCGCCAGC
GGCAACTGGA TGTGGACGGG CACGGGGACG ATCAGACGGA CGCTCACCGA CACCGCCGAG
GGCGTCAGAG TCGACTACAC CGCCGGCCGC AACATAGGGA TCACCTCCAG AGGCACGGTC
GGGCAGTTCG CGCTGCCGGG CGCGCCGCTG AGAGTTCGCG TGAGAGTGAA GTCGAGCGCG
AACGTCAGCC TCACCTACGG TGCCTTCGCG CAGGCGGACG GCACCTACAA GAACCTCTAC
GGCGCGGCGC TGAAGCCGGG CTGGAACGAC GTCGAGTTCG CGCTCCCGGC CGACACGAAG
TACCCGATCA AGCTCGACAC CTTCCAGGCG ATCGAGACGA CCGTCGCTGC GCAGAGAGAC
GGCTCGATCG TCGTCGGCGA GGTCTCGGCC GACGTGCCGA GCGAGGTCGA GCTGCCCGAG
CAGGAGCCGC TGCGCTCCGA CCGGCTGCTG TCCGCCGACG GCGACCTGCA GGAGGGCGCG
GACTTCAGCT TCGCGACGCT CTCCGACGTC CAGTTCACGC ACGTCAACCA GGAGATGGTC
CCGGTCGCGG TCCAGGCGCT GCGGCGGATC CGCGCGACGA GACCCGACCT CGTCGTGCTC
AACGGCGACA TCGTCGACCT CGGCGCCGCC GAGGACATGA CGCTCGCGCG CAGAACGCTG
GAGGCAGGCG GCTGCCAGCT CGTCCCGCTC GGCACGCCGG ACGTCCCCAC GCCGACCGCA
GACACGGTTC CGTGCCTCTA CGTCCCCGGC AACCACGAGT CGTACGTCGC CGGCGGCCAG
GGCACGCTCG ACGCGTTCAA GGCCGAGTTC GGCAGACCGT ACGGCTACAC CGACCACAAG
GGCACCCGCT TCATCACGCT CAACAGCTCC TACGGGTCGC TGCGCGGCTC GGACTTCGCG
CAGCTGCCGA TGCTGCAGGA GGCGCTGGAG CAGGCGGCGG GCGAAGACGG CATCGACAAC
GTGATGGTCT TCGCCCACCA CCCGGTCGAC GATCCCGCCG AGACGAAGTC GAGCCAGCTC
GGCGACCGCA CCGAGGTCCA GCTCGTCAAG AAGCTGTTGT CGGACTTCCG CGACGACAGC
GGCAAGGGCG TCGCCATGGT CGGCTCGCAC GCGCAGATCA TGAACGTCCG CCGCGAGGAG
GGCGTCCCGT ACGTCGTGCT CCCGTCGTCG GGCAAGGCGC CGTACGGCAC GCCCGACCGC
GGCGGCATCA CCGGCTGGGT CCGCTGGGGC GTCGACAGCG ACGAGAACGC CGCGGGCGAC
TGGCTCGAAG GCGACGTGCG CCCGTTCGCG CAGACGATCG ACCTGCAGGC ACCGGCGACG
CTCGAGGTCG GCAACAGTGC GCCGCTCGGC GGCTCGCTCG TGCAGCCGAG CGGGGTCAGA
AACGGCAGCC GCACCGTGCC GCTGCGCTAC CCGCTGTCGC TGCGCTGGTC CGGCAGCGAG
CAGCTCGCGA TCGGCAGCGG CGACGACGCG GTCGAGGCGG CCCGCGAGGC CGACAAGACC
GCGATCCTCG ACCCGCAGAC GGGCGCGCTG ACGGCGCTCT CGACCGGCAC GGTCGAGGTC
ACGGTCGCGG CCGACTCGAT GCGCGAGGGA GACGACCTCG CGCCGATCAC GGCGACGAAG
ACGATCGCAG TCGCGCCGTC GACCGCTCCC GGCCCGAAGG CGTGGATCAG CGCGCCGGTC
TTTCCCGACC AGGCTGCCAC GACGATCGGC GCCGGGCAGC CGGTGACGGT CGCCAACACC
GGTGAGGAGC CGCTCCAGGT CGGGCTCGAC AGAGTGCTCG CGCTCGAAGG CCCGCAGGGC
GACTTCGTCG TCGCCGACGA CAGCTGCAGC GCGGCGCCCG TCGCGCCGGG CGCGTCGTGC
ACGGTGCTCG TGCGCTTCGC CCCGTCGCGC GAGAACGCGA GATCGTCGGC GCGGTTCGTC
TTCCGCGACA ACACCGCCGA GCAGCGTCAC ACGGTGACGA TCAACGCGAC CTCGACCGGT
CTGCCGAGAG GCGACAAGGG CGACCAGGGC CAGCCCGGCA TCCCGGGCGA GGACGGTCCC
GCCGGCCCGC AGGGTCCGCA GGGCCCCGCC GGCAGAGACG GGCCGCAGGG TCCGGCCGGC
AGAGACGGGC CGCAAGGCGA GTCCGGCCCG ATCGGCCCGC AGGGTCCGGC CGGCGGCGAC
GGCGCGCAGG GCCCGGCCGG CGCCAAGGGC GACACCGGCG CGCCCGGTGC GACCGGCGCC
AAGGGCGAGA CGGGCGAGCG CGGCGAGAAG GGCGCCAAGG GCGACCGCGG CGCGGCCGGC
CGGGACGCGC TCGTCACGTG CACGGTCAGA GGCGGCGTCT ACCAGAGAGT CACGTGCGTC
GTGACGTACT CGAGCAGAAG CGCGCTGCGC AACGCGAAGG TCAAGGGCAC GTCGAAGGCG
CGCCTGACGC GCGCCGGGCG CACCTACGCC AGCGGCCGCG TCGGCTCGCT CAGAGCGAGC
CGCACGGTCA GCCGCGGGCG CTACACGCTG CTCGTCGGCA GCGGCAAGAA CGCGGCGAAG
GTGGCCGTGA CCGTCCGCTA G
 
Protein sequence
MTSSSAPRKR RWSSLPAALA AAALIVALPP AAHAGGLDLI DETEQIGPGV SMRHLKTLEA 
GGWFDYQLLT ARLQGGVVTS DLLSGDSVTE AGPISKKADR AGAVAGVNGD FFDINNSNAP
QNLAVRGGEL LKSANFGLTA PATGVTRDGI GQLLSTTLDA KATFGGADLP VAALNAAGAV
PADGYVAFTP KWGRSSRARS LAGVANVAEA LVTDGRVVAV SDGVGAGEIP AGSFYLVGRE
SAADAIRALR AGDEVRLAYG LSGDVAQQLQ FAIGGNEVLV RDGQVVGSDQ SVHPRTAIGF
KDGGRTLLLF VADGRQTQVL GMTTQKVAQL LRDAGAETAM NLDGGGSTTL VARPLGDSRA
AVRNTPSDGA ERHDPNGVGL FVSAGNGQVE QLVLSPSGEQ ARVFPGLHRT LRVKAVDDHQ
TPVALARGDV RWSSNEGSVD GGLLRAPENV FGQIRVKATT DTAQEEAVVR VLGELHALEL
SSQRLSFSAT GAEQARTLKV TGRDAHGFTA PVESADLELD YDDAVVKVTP SGDALKITPL
QAGGTVLTVS AGSQSVKLPI SVGVQTHVND YFLTTSPIAS GNWMWTGTGT IRRTLTDTAE
GVRVDYTAGR NIGITSRGTV GQFALPGAPL RVRVRVKSSA NVSLTYGAFA QADGTYKNLY
GAALKPGWND VEFALPADTK YPIKLDTFQA IETTVAAQRD GSIVVGEVSA DVPSEVELPE
QEPLRSDRLL SADGDLQEGA DFSFATLSDV QFTHVNQEMV PVAVQALRRI RATRPDLVVL
NGDIVDLGAA EDMTLARRTL EAGGCQLVPL GTPDVPTPTA DTVPCLYVPG NHESYVAGGQ
GTLDAFKAEF GRPYGYTDHK GTRFITLNSS YGSLRGSDFA QLPMLQEALE QAAGEDGIDN
VMVFAHHPVD DPAETKSSQL GDRTEVQLVK KLLSDFRDDS GKGVAMVGSH AQIMNVRREE
GVPYVVLPSS GKAPYGTPDR GGITGWVRWG VDSDENAAGD WLEGDVRPFA QTIDLQAPAT
LEVGNSAPLG GSLVQPSGVR NGSRTVPLRY PLSLRWSGSE QLAIGSGDDA VEAAREADKT
AILDPQTGAL TALSTGTVEV TVAADSMREG DDLAPITATK TIAVAPSTAP GPKAWISAPV
FPDQAATTIG AGQPVTVANT GEEPLQVGLD RVLALEGPQG DFVVADDSCS AAPVAPGASC
TVLVRFAPSR ENARSSARFV FRDNTAEQRH TVTINATSTG LPRGDKGDQG QPGIPGEDGP
AGPQGPQGPA GRDGPQGPAG RDGPQGESGP IGPQGPAGGD GAQGPAGAKG DTGAPGATGA
KGETGERGEK GAKGDRGAAG RDALVTCTVR GGVYQRVTCV VTYSSRSALR NAKVKGTSKA
RLTRAGRTYA SGRVGSLRAS RTVSRGRYTL LVGSGKNAAK VAVTVR