Gene Cwoe_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1067 
Symbol 
ID8731502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1125359 
End bp1128628 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content74% 
IMG OID646501684 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003392874 
Protein GI284042534 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.239328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.652246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTT CTGTCCCGCT CGCGGCCGCC GGGCTCGCGG CCGCGCTCCT CGGCGGCGCG 
GGCGCCTCCG GCGCCTCGGC CTCGACCGCC TTCGTCAGCG GCGGCCCCAG CAGCGGACTG
GTCGTCCCGT TCGACCTCGG CACCGCGACC GCGAGACCCG CGATCACAGT CGGCTACGGC
CCGCTCGCGA TCGTCCCAGC ACCTGACGGG AGAACCGTCT ACGCGATCTC ACAGAGCCTG
ATGGAGGGCA CGTCGGTCAC GCCGATCGAC GTCGCGACCG GCACGGCGCT GACGAGAATC
ACGGGCACGG CGTTGAGAGC GGCCGGCGGC GGCGCGATCG CGCCGAACGG CCAGACGCTC
TACGTCACCG CCGGCACCAC GCTGCTGCCG ATCGACCTCA GCGGGCCGAC GCCGGCGATC
GGCACGCCGA TCCCGCTCGG CGACACCGCC GTCAGCAGCC CGGTGATCTC GCCCGACGGG
AGCACCGCCT ACGTGATCGC CGGCAGCACC GGCGTGCTGC CCGTCGACCT CGCGACCGGC
ACGGCCGGCG CGAGACTGGC GATCCCCGGC ACGCTCGGCC GGCTCGCCAT CACCAGAGAC
GGCACGACGC TCTACGCCGC GCAGACCCGC ACCGCGACGG GCACGGGCAA CCTCGGCGTC
GTCCCGTTCG ACACCGCCAC CAGAACCGCC GGCCCGATCG TCGGGATCGG CGCGCTGACC
GCGTTCGGCC CGGAGGGCAT CGCCGTCGGT CCGGACGGCA GAACCCTCTA CGCGACGCGC
AGCAACACCG TCGACCCGAA CCTCATCATC GACGTCGACC TGGTGAGCGG CACGCTGACC
GAGACCGCGC TCGGCAGCCG CACCAACACG CGCGGCCTCG CGCTGACGCC GCGCGGGAGA
ACGGCGGTCG TCGGCAACTT CGGCCTCGGC ACGCTCGGCG TCGTCGACCT GCCGACGCGC
TCGGTCGTGC AGACGGCGAG ACTGGCGCTG CCGGGACAGA CCGTCAGCCC GATCGCGGTC
GGGATCGTCT CGACGAGAAG CCCGACCGGC GCCGCGACCC CGACGATCGC CGCCAGCGTC
CCGACGCAGT CGGGCGTGAT CGGCGACGCG ACGAACCCGG CGATCAGAGC GACGATCGAG
CAGCTCGACG AGTACGGCGA CCCGGCCTCG CCGAGCGAGC TGACGGTCGA GGCGACGTCG
TCCAACCCCG CCGTCGTGCC GACGAGCGGC ATCGCCGTGA GCGGCACCGG CGCGACCCGC
ACCGTCTCCT TCGCACCGAC CGGGCGCGGC CACGCGACCG TGACGCTGAG AGTCACCGGC
CTGGAGGGCA AGAGCGCGAC GACGACGATG ACCTACTCGG CCTCCAGAGC GACGACGCCG
ACGAGCCGCG CCCTGCAGGG CAGCGGCGAC TCGTCCTCGG CGATCTCCGC CGGCGGCGGC
CACCTGCTCG TCGCCGACGA CGAGAGAGAC GACATCCGGC TCTACCGCGA CGACGTCACC
GGCGAACCGG TGAGATCGTT CAACATCGGC CCGGCGGCGA CCGGCGGCGG CGAGATCGAC
TACGAGTCGT CCGCCCGCAA CGGCGACGTC ATCTATTGGC TCGGCTCGCA TGGCAACAAG
AAGAGCGGCA GCCTCGAGAC CTCGCGCCAC ACGCTGATCG CGACGAGAGT CGCCGGCGAG
GGCGCCGACA CGACACTGAC CCGCACCGGT ATCTACGGCA ACCTGCGGAC CGACCTCGTC
GCCTGGGACC AGGCGAACGC GAACCGCCTC GGCTTCGCCG CCGGGACCCA GAGCGGCGTG
CTGCCGGACG CCAGAAACGG CTTCAACATC GAGGGCGCCG ACATGGCGCC GGGCTCGACG
AAAACGCTCT ACCTCGGCTT CCGCTCGCCG CTCGTCACGA CGCCCGACGG CGACCGCGCG
GTGATCGTCC CGGTCACCAA CGTCGCCCTG CTGGCGACGG GCGAGGCGCC GAAGGCGACG
TTCGCCGACC CGATCCTGCT CGACCTCGAG GGGATGACGA TCCGCGAGCT GCGCAAGAAC
GCGGCCGACC AGTTCCTGAT CCTCGCCGCC AAGAGAGGCG CGCTCGGCGT CGAGCAGGCG
CTGTGGAGCT GGTCGGGCCA CCGCGAGGAC AAGCCGGTCA AGCTGACGAC CGCGCTGCCG
CCGAGCGCCG AGTCGTTCTC CGACGGGCAG GGCACGTGGG AGGCGATCGG CACGCTGCCG
GACGTGCTCG CTCCGAACGC CGCGCTGCGG CTCGTGATGG ACCAGGGCTA CGACGAGCTG
TACGACGGGC AGGACAACAA GGACATCAGC GACGTGCGGC TGAAGAAGTC GCGCATCGAC
GTCTTCTCGC TCACCGGCGC GGTCGGCGCT GACGCCGTCG CGGCGGCGCC GGCGTTCCCC
GCTCAGGCGG CCGGGACGAT CGGCCCGGCG CAGGCGGTGA CGGTCAGAAA CGCCGGCGCG
CAGCGGCTGA GAATCGGCTC CGTCGGCGTG GAGGCGGACG CGGCGGTCGC CGACGGCGAC
TTCCTGATCG CCGCCGACGC GTGCGCCGGG AAGGAGCTGG GTCCCGACGC GAGCTGCCGC
GTGCTCGTCC GCTTCGCGCC GGCGCGCGAG AGCGCGACGT CGACGGCGCG GCTGGTGCTG
AAGGCGAACG TCGCGGGCGG CGCGGCGGCG GTCGCGCTGA CGGGCACCAG CACGACGCTG
CCGGCGGGTC CGACGGGTCC GACCGGGCCG GCCGGCCCCG CGGGACCGGG CGGTGAGGAC
GGCGCGAACG GGCCGAAGGG CGACAGAGGC GACGCCGGCG CGAAGGGCGA CGCGGGCGCG
GGCGGGCCGA AGGGCGACGC CGGGGCGGCC GGGCCGAAGG GCGATCCCGG TGCCAAGGGC
GACCGCGGCG ACGGCGGCAC GCCCGGGAGC GCGGGGCCGA AGGGCGACAG AGGCGACAGG
GGCGCGGACG GGTCGATCGT CTTCGCCGCG AGCCGGTCGC AGCTCGCCGC CCGCCGCGGG
CGCACGGTGA GCCTGCCGTT CGAGCTGCGC AACACGACCG GCGGCGCGAT CGCGCGAGCG
ACCGCGACCG TGCGGGTGCC GGGCGGGCTG CGGATCGCGC AGCCGAAGGC GGTCCGGATC
GCGTCGCTGA AGGCGGGCGA GGGCCGCACG CTGCGGCTCC GGCTGCGGAT CGGGCGCGGC
GCCCAGCTCG GGCGCCACCG CGTGCAGGTC CGCCTCGACG TCGGCGGCCG CAACGTCACG
CGCACCGTGA CGGTCGACGT GCGCCGGTAG
 
Protein sequence
MRVSVPLAAA GLAAALLGGA GASGASASTA FVSGGPSSGL VVPFDLGTAT ARPAITVGYG 
PLAIVPAPDG RTVYAISQSL MEGTSVTPID VATGTALTRI TGTALRAAGG GAIAPNGQTL
YVTAGTTLLP IDLSGPTPAI GTPIPLGDTA VSSPVISPDG STAYVIAGST GVLPVDLATG
TAGARLAIPG TLGRLAITRD GTTLYAAQTR TATGTGNLGV VPFDTATRTA GPIVGIGALT
AFGPEGIAVG PDGRTLYATR SNTVDPNLII DVDLVSGTLT ETALGSRTNT RGLALTPRGR
TAVVGNFGLG TLGVVDLPTR SVVQTARLAL PGQTVSPIAV GIVSTRSPTG AATPTIAASV
PTQSGVIGDA TNPAIRATIE QLDEYGDPAS PSELTVEATS SNPAVVPTSG IAVSGTGATR
TVSFAPTGRG HATVTLRVTG LEGKSATTTM TYSASRATTP TSRALQGSGD SSSAISAGGG
HLLVADDERD DIRLYRDDVT GEPVRSFNIG PAATGGGEID YESSARNGDV IYWLGSHGNK
KSGSLETSRH TLIATRVAGE GADTTLTRTG IYGNLRTDLV AWDQANANRL GFAAGTQSGV
LPDARNGFNI EGADMAPGST KTLYLGFRSP LVTTPDGDRA VIVPVTNVAL LATGEAPKAT
FADPILLDLE GMTIRELRKN AADQFLILAA KRGALGVEQA LWSWSGHRED KPVKLTTALP
PSAESFSDGQ GTWEAIGTLP DVLAPNAALR LVMDQGYDEL YDGQDNKDIS DVRLKKSRID
VFSLTGAVGA DAVAAAPAFP AQAAGTIGPA QAVTVRNAGA QRLRIGSVGV EADAAVADGD
FLIAADACAG KELGPDASCR VLVRFAPARE SATSTARLVL KANVAGGAAA VALTGTSTTL
PAGPTGPTGP AGPAGPGGED GANGPKGDRG DAGAKGDAGA GGPKGDAGAA GPKGDPGAKG
DRGDGGTPGS AGPKGDRGDR GADGSIVFAA SRSQLAARRG RTVSLPFELR NTTGGAIARA
TATVRVPGGL RIAQPKAVRI ASLKAGEGRT LRLRLRIGRG AQLGRHRVQV RLDVGGRNVT
RTVTVDVRR