Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1067 |
Symbol | |
ID | 8731502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 1125359 |
End bp | 1128628 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646501684 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003392874 |
Protein GI | 284042534 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.239328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.652246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTT CTGTCCCGCT CGCGGCCGCC GGGCTCGCGG CCGCGCTCCT CGGCGGCGCG GGCGCCTCCG GCGCCTCGGC CTCGACCGCC TTCGTCAGCG GCGGCCCCAG CAGCGGACTG GTCGTCCCGT TCGACCTCGG CACCGCGACC GCGAGACCCG CGATCACAGT CGGCTACGGC CCGCTCGCGA TCGTCCCAGC ACCTGACGGG AGAACCGTCT ACGCGATCTC ACAGAGCCTG ATGGAGGGCA CGTCGGTCAC GCCGATCGAC GTCGCGACCG GCACGGCGCT GACGAGAATC ACGGGCACGG CGTTGAGAGC GGCCGGCGGC GGCGCGATCG CGCCGAACGG CCAGACGCTC TACGTCACCG CCGGCACCAC GCTGCTGCCG ATCGACCTCA GCGGGCCGAC GCCGGCGATC GGCACGCCGA TCCCGCTCGG CGACACCGCC GTCAGCAGCC CGGTGATCTC GCCCGACGGG AGCACCGCCT ACGTGATCGC CGGCAGCACC GGCGTGCTGC CCGTCGACCT CGCGACCGGC ACGGCCGGCG CGAGACTGGC GATCCCCGGC ACGCTCGGCC GGCTCGCCAT CACCAGAGAC GGCACGACGC TCTACGCCGC GCAGACCCGC ACCGCGACGG GCACGGGCAA CCTCGGCGTC GTCCCGTTCG ACACCGCCAC CAGAACCGCC GGCCCGATCG TCGGGATCGG CGCGCTGACC GCGTTCGGCC CGGAGGGCAT CGCCGTCGGT CCGGACGGCA GAACCCTCTA CGCGACGCGC AGCAACACCG TCGACCCGAA CCTCATCATC GACGTCGACC TGGTGAGCGG CACGCTGACC GAGACCGCGC TCGGCAGCCG CACCAACACG CGCGGCCTCG CGCTGACGCC GCGCGGGAGA ACGGCGGTCG TCGGCAACTT CGGCCTCGGC ACGCTCGGCG TCGTCGACCT GCCGACGCGC TCGGTCGTGC AGACGGCGAG ACTGGCGCTG CCGGGACAGA CCGTCAGCCC GATCGCGGTC GGGATCGTCT CGACGAGAAG CCCGACCGGC GCCGCGACCC CGACGATCGC CGCCAGCGTC CCGACGCAGT CGGGCGTGAT CGGCGACGCG ACGAACCCGG CGATCAGAGC GACGATCGAG CAGCTCGACG AGTACGGCGA CCCGGCCTCG CCGAGCGAGC TGACGGTCGA GGCGACGTCG TCCAACCCCG CCGTCGTGCC GACGAGCGGC ATCGCCGTGA GCGGCACCGG CGCGACCCGC ACCGTCTCCT TCGCACCGAC CGGGCGCGGC CACGCGACCG TGACGCTGAG AGTCACCGGC CTGGAGGGCA AGAGCGCGAC GACGACGATG ACCTACTCGG CCTCCAGAGC GACGACGCCG ACGAGCCGCG CCCTGCAGGG CAGCGGCGAC TCGTCCTCGG CGATCTCCGC CGGCGGCGGC CACCTGCTCG TCGCCGACGA CGAGAGAGAC GACATCCGGC TCTACCGCGA CGACGTCACC GGCGAACCGG TGAGATCGTT CAACATCGGC CCGGCGGCGA CCGGCGGCGG CGAGATCGAC TACGAGTCGT CCGCCCGCAA CGGCGACGTC ATCTATTGGC TCGGCTCGCA TGGCAACAAG AAGAGCGGCA GCCTCGAGAC CTCGCGCCAC ACGCTGATCG CGACGAGAGT CGCCGGCGAG GGCGCCGACA CGACACTGAC CCGCACCGGT ATCTACGGCA ACCTGCGGAC CGACCTCGTC GCCTGGGACC AGGCGAACGC GAACCGCCTC GGCTTCGCCG CCGGGACCCA GAGCGGCGTG CTGCCGGACG CCAGAAACGG CTTCAACATC GAGGGCGCCG ACATGGCGCC GGGCTCGACG AAAACGCTCT ACCTCGGCTT CCGCTCGCCG CTCGTCACGA CGCCCGACGG CGACCGCGCG GTGATCGTCC CGGTCACCAA CGTCGCCCTG CTGGCGACGG GCGAGGCGCC GAAGGCGACG TTCGCCGACC CGATCCTGCT CGACCTCGAG GGGATGACGA TCCGCGAGCT GCGCAAGAAC GCGGCCGACC AGTTCCTGAT CCTCGCCGCC AAGAGAGGCG CGCTCGGCGT CGAGCAGGCG CTGTGGAGCT GGTCGGGCCA CCGCGAGGAC AAGCCGGTCA AGCTGACGAC CGCGCTGCCG CCGAGCGCCG AGTCGTTCTC CGACGGGCAG GGCACGTGGG AGGCGATCGG CACGCTGCCG GACGTGCTCG CTCCGAACGC CGCGCTGCGG CTCGTGATGG ACCAGGGCTA CGACGAGCTG TACGACGGGC AGGACAACAA GGACATCAGC GACGTGCGGC TGAAGAAGTC GCGCATCGAC GTCTTCTCGC TCACCGGCGC GGTCGGCGCT GACGCCGTCG CGGCGGCGCC GGCGTTCCCC GCTCAGGCGG CCGGGACGAT CGGCCCGGCG CAGGCGGTGA CGGTCAGAAA CGCCGGCGCG CAGCGGCTGA GAATCGGCTC CGTCGGCGTG GAGGCGGACG CGGCGGTCGC CGACGGCGAC TTCCTGATCG CCGCCGACGC GTGCGCCGGG AAGGAGCTGG GTCCCGACGC GAGCTGCCGC GTGCTCGTCC GCTTCGCGCC GGCGCGCGAG AGCGCGACGT CGACGGCGCG GCTGGTGCTG AAGGCGAACG TCGCGGGCGG CGCGGCGGCG GTCGCGCTGA CGGGCACCAG CACGACGCTG CCGGCGGGTC CGACGGGTCC GACCGGGCCG GCCGGCCCCG CGGGACCGGG CGGTGAGGAC GGCGCGAACG GGCCGAAGGG CGACAGAGGC GACGCCGGCG CGAAGGGCGA CGCGGGCGCG GGCGGGCCGA AGGGCGACGC CGGGGCGGCC GGGCCGAAGG GCGATCCCGG TGCCAAGGGC GACCGCGGCG ACGGCGGCAC GCCCGGGAGC GCGGGGCCGA AGGGCGACAG AGGCGACAGG GGCGCGGACG GGTCGATCGT CTTCGCCGCG AGCCGGTCGC AGCTCGCCGC CCGCCGCGGG CGCACGGTGA GCCTGCCGTT CGAGCTGCGC AACACGACCG GCGGCGCGAT CGCGCGAGCG ACCGCGACCG TGCGGGTGCC GGGCGGGCTG CGGATCGCGC AGCCGAAGGC GGTCCGGATC GCGTCGCTGA AGGCGGGCGA GGGCCGCACG CTGCGGCTCC GGCTGCGGAT CGGGCGCGGC GCCCAGCTCG GGCGCCACCG CGTGCAGGTC CGCCTCGACG TCGGCGGCCG CAACGTCACG CGCACCGTGA CGGTCGACGT GCGCCGGTAG
|
Protein sequence | MRVSVPLAAA GLAAALLGGA GASGASASTA FVSGGPSSGL VVPFDLGTAT ARPAITVGYG PLAIVPAPDG RTVYAISQSL MEGTSVTPID VATGTALTRI TGTALRAAGG GAIAPNGQTL YVTAGTTLLP IDLSGPTPAI GTPIPLGDTA VSSPVISPDG STAYVIAGST GVLPVDLATG TAGARLAIPG TLGRLAITRD GTTLYAAQTR TATGTGNLGV VPFDTATRTA GPIVGIGALT AFGPEGIAVG PDGRTLYATR SNTVDPNLII DVDLVSGTLT ETALGSRTNT RGLALTPRGR TAVVGNFGLG TLGVVDLPTR SVVQTARLAL PGQTVSPIAV GIVSTRSPTG AATPTIAASV PTQSGVIGDA TNPAIRATIE QLDEYGDPAS PSELTVEATS SNPAVVPTSG IAVSGTGATR TVSFAPTGRG HATVTLRVTG LEGKSATTTM TYSASRATTP TSRALQGSGD SSSAISAGGG HLLVADDERD DIRLYRDDVT GEPVRSFNIG PAATGGGEID YESSARNGDV IYWLGSHGNK KSGSLETSRH TLIATRVAGE GADTTLTRTG IYGNLRTDLV AWDQANANRL GFAAGTQSGV LPDARNGFNI EGADMAPGST KTLYLGFRSP LVTTPDGDRA VIVPVTNVAL LATGEAPKAT FADPILLDLE GMTIRELRKN AADQFLILAA KRGALGVEQA LWSWSGHRED KPVKLTTALP PSAESFSDGQ GTWEAIGTLP DVLAPNAALR LVMDQGYDEL YDGQDNKDIS DVRLKKSRID VFSLTGAVGA DAVAAAPAFP AQAAGTIGPA QAVTVRNAGA QRLRIGSVGV EADAAVADGD FLIAADACAG KELGPDASCR VLVRFAPARE SATSTARLVL KANVAGGAAA VALTGTSTTL PAGPTGPTGP AGPAGPGGED GANGPKGDRG DAGAKGDAGA GGPKGDAGAA GPKGDPGAKG DRGDGGTPGS AGPKGDRGDR GADGSIVFAA SRSQLAARRG RTVSLPFELR NTTGGAIARA TATVRVPGGL RIAQPKAVRI ASLKAGEGRT LRLRLRIGRG AQLGRHRVQV RLDVGGRNVT RTVTVDVRR
|
| |