Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1538 |
Symbol | |
ID | 4184516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 1792019 |
End bp | 1795057 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638071532 |
Product | TPR repeat-containing protein |
Protein accession | YP_678149 |
Protein GI | 110637942 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0101738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.837321 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAAA AAAGTATTTT ACTGTTCTTT GCATGCATCA ATGTTTTTAT TGCATTTGCT CAGAATACAG CTGCACATAA TGCTTCAGAT CGAACGTATT ATGATGGCAT GGAACTCTTT GATAAGCAAA AATATGTTGC TGCCAGAGAA TTGTTCCGCC AGTATATTCA ATCTGCCCCA CAGGAAACCA GAGCCATTGA ATGTGAATAT TATATAGGCC TGTGCGCCTT GAATTTGTTT AACGACGATG CAGAGTATCT GTTAAATAAT TTCGTTGAAA AATATCCCAA TCATATTAAA TCGGGCAGAG CCGGTTTTGA CTTAGGAAAT TTTTTTTATA CCAATAAATC ATACGATAAG GCCATTCTAT ACTATGCTAA AGTAGATGAA TCAAAACTTT CTCACGAAGA ATTAAATAAC TACTATTTTA AGTCGGGTTA TTCAAGCTTT ACCAAAAAAG ATTTTGCTAC AGCACTGGAT AAATTTAATA AATGTAAAGG TGCCAAACAT CAGTACACAC CTGCTGCAAA TTATTATGCC GGCTATATAG AATTTAAAAA TGGTGAATAC GATACGGCAA TAGCTGATTT GCAGAAAGCA GCGGAAAGTA AGGAGTATAA GCCTTTAGTA GCCGTATTGA TTGCTAATAT CTATTACAGA CAGGCGAAGT ATGATGAACT GATCCCGTAT GCGGAAAAAG TAATTGCAGA TAAATCAGCC GGTCCTAATA CCAATGATGT GAAGCTGATC CTGGCAGATG CCTATTTTTT CAAACAGGAA TATGCTAAAG CAACACCTTT ATTTAAGGAT TATTTAACGG CGACAGGTAC AAAGAGCCTT ACACCGGATA TGAAATACCG GATCGGATTC TCCTCATACA AAGCAGCAGA TTATAAACAG GCCGTAGATA TGCTTCAGGC AATTGCTACC GATAAAGATT CTCTCGGACA GTCATCTGCT TATATATTGG GTTTGAGTTA TTTAAAATCA GAAAACAAAA ACGCTGCGCT GATCAGTTTT GAACTGGCAC AGCGTTCTGT ATTCAGTGCC GTAATAAATG AAGAAGCAAT GTTTTTGTAT GCAAAGATAA CATCTGATTT AGGGCGTTTT ACTGAAGCAA CACCACGTTT AAAGAATTTT ATTGAAAAAT ATCCGAAGAG TTACCACATG CAGGAAGCAT ATGAATTGCT GAGTGAATCA TTTTTAGGTT CCCGCAATTA TGATGAAGCC TTAGTATATA TCGAAAATTT AAAACATCAG AACTCCCGCA TTCACACTAC GTACCAGCGT ATTGCTTATT ACAAAGCCGT TGAACTGATT AATAAAAAAC AGTATGCTGC CGCTATTGTT GCACTGGATA AGTCTTCTAC CTATAATTTT GACAAGACGG TATACATCAA TTCGTTTTAC TGGAAAGGCG AATGTCTTTC GTTGGAAAAA AGATACCCGG AAGCCATTGA GTCATATTCA ACGGCTATTG AAAAAGGCAA TACGCTTGAA AATACCAATG TTGCAAAAGC CTATTACGGA CAGGGCTATG CATATTATTA CCAGAAGAAA TGGGACGAAG CCATTCCGCA GTTTGTAAAA TTCATTACCA TACAGGCAGA GAAAAAAGGC AAAGGAGATA TTTATTACCG CGATGCTGAA TTACGTTTGG CTGACCTGTA TTTTGTAACC AGACGTTTTC CTGAAGCAAT AAAATATTAT GACTTTGCAC TTGCTGAACG GAATGAAGAT GAAGATTATA TTCTTTTTCA GAAAGCAACA ATCAATTACT TAACCAATAA AACGTCGCAG GCATATTCCA TGTATGCGCA GCTGACAAAT AAATATCCGA ATTCTGCATA TTACGATCAG GCATTGATGC AACGTTGTCA GCTGGATCTG GAAGCCAGTA AGTACGAAGT AGCTATTGAA GGCTACAATA AGATTATCAA TAAAAAAGAA GATCTGAACG GACTTAAACC AATTGCCTTA CAGAAGCGTG CGATTTCCTA TTTCAATTTA AAAGATTATA ACAAAGCGAT TGCCGATAAT AAGCGCATCG TGTATGAATA CCCTAAAAGC GCTCCGGCTT ACAGTGCGTT ATTAAGTATT CAGGAGATGC TGGGCATTCA GGATAAATCA GAAGAGTTTG CACCAATTCT TGCAAAGTAT AAAGAATTAA ATCCAAGTGA TCAGGATCTG AAAGAAGTAG AATTCAGAAG TGCACAATCG TTGTACAGTT CTCAGAAATA TCCGCAGGCG ATTACAGGTT TTGCTGCCTT TATCCGTCAG TATCCGACAC ACCCGAATGT TAGTGAAGCA CAGTATTACC TGGCAGATTC GTATTTCAGA ACAAAAGATT ACAGCAACGC GAAGGCAACA TACATTGAGA TTCTGGCAGA AAAAAATCAA TATTATAAAA AATCGCTGCA GCGTGTGGCA GATATTGCTT ACCTGCAAAA CGACCTGCCG GGTTCATTAA AATATTATTC CGAATTCCAG TCGCTGGCAT CAAGCAATAA AGAAAAAACC GCAGCCTGGA CAGGTTTAAT GGTTGCGTTT TTTGATTCCA ATCAATTAGA CTCCTGCATG GCATATGCTA CGCGCATTAT AAGCCATGGT TCGGCAGCGC CGGATGCAGA AAATAAAGCA CAGCTTTATA AAGGAAAAGT ATCCTATCAG AAAAAAGAGT ATGATGCTGC CCAGGATTAT TTATTGTCTT GCATAAACGG ACCTAAAGAT ATCAGTGCCG CGGAAGCAAT GTATTACGTA GCCAAAATCC AGTTCGATAA AAAGCAATAT GCTCAATCAA TGGAAACGCT GTATGATTTC AATAATACAT TCAACAGTTA CGAAACATTG CTTGGTAAAT CGTATTTATT GATCGCAGAA AATCTTGTTA TGAAGAATGA GATTTTCCAG GCGAAAGAAA CCTTGAACTC AATCATTTTG AAATTCCCGA ACGCAGATAT TAAAAAGCAG GCTCAGGCAC GTTTGGATCA ATTAAATGCT ACAGATGCTA AAACAAAGGA GGGTGCAGGA AATGAATAA
|
Protein sequence | MVKKSILLFF ACINVFIAFA QNTAAHNASD RTYYDGMELF DKQKYVAARE LFRQYIQSAP QETRAIECEY YIGLCALNLF NDDAEYLLNN FVEKYPNHIK SGRAGFDLGN FFYTNKSYDK AILYYAKVDE SKLSHEELNN YYFKSGYSSF TKKDFATALD KFNKCKGAKH QYTPAANYYA GYIEFKNGEY DTAIADLQKA AESKEYKPLV AVLIANIYYR QAKYDELIPY AEKVIADKSA GPNTNDVKLI LADAYFFKQE YAKATPLFKD YLTATGTKSL TPDMKYRIGF SSYKAADYKQ AVDMLQAIAT DKDSLGQSSA YILGLSYLKS ENKNAALISF ELAQRSVFSA VINEEAMFLY AKITSDLGRF TEATPRLKNF IEKYPKSYHM QEAYELLSES FLGSRNYDEA LVYIENLKHQ NSRIHTTYQR IAYYKAVELI NKKQYAAAIV ALDKSSTYNF DKTVYINSFY WKGECLSLEK RYPEAIESYS TAIEKGNTLE NTNVAKAYYG QGYAYYYQKK WDEAIPQFVK FITIQAEKKG KGDIYYRDAE LRLADLYFVT RRFPEAIKYY DFALAERNED EDYILFQKAT INYLTNKTSQ AYSMYAQLTN KYPNSAYYDQ ALMQRCQLDL EASKYEVAIE GYNKIINKKE DLNGLKPIAL QKRAISYFNL KDYNKAIADN KRIVYEYPKS APAYSALLSI QEMLGIQDKS EEFAPILAKY KELNPSDQDL KEVEFRSAQS LYSSQKYPQA ITGFAAFIRQ YPTHPNVSEA QYYLADSYFR TKDYSNAKAT YIEILAEKNQ YYKKSLQRVA DIAYLQNDLP GSLKYYSEFQ SLASSNKEKT AAWTGLMVAF FDSNQLDSCM AYATRIISHG SAAPDAENKA QLYKGKVSYQ KKEYDAAQDY LLSCINGPKD ISAAEAMYYV AKIQFDKKQY AQSMETLYDF NNTFNSYETL LGKSYLLIAE NLVMKNEIFQ AKETLNSIIL KFPNADIKKQ AQARLDQLNA TDAKTKEGAG NE
|
| |