Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0033 |
Symbol | |
ID | 4183682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 42373 |
End bp | 44673 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 638070031 |
Product | hypothetical protein |
Protein accession | YP_676667 |
Protein GI | 110636460 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000843922 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000130062 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACACAAA TAGATTACAA AAAAGCATTT TCAAAGTTAA GTCACATTAA AGCATTATCT CATGATCAAA AATTTGATCA AATTGTTCAA AATTTAATCA CTCTTGCTTT AAATCAAAAG GTAGAGGAAA ATCCAAAAAA TGAAGCACAA GTAGCGGCTA GAATTAATGA CATTTATGGA ATGTCTATAA GGCTTCCAAT AATTTTATCA AATATCGATA AACTACTTTC ATTAGGTGAA ATAACAAAAG ATCCATTATC AAAACAACTA CATGTCACGC CAGTTATTTC AAATAAATTA AAACAACGAC TTGAAGTTGC ATCTCAATTA GAGAATAGAG TAAAAATTGG GTGGTATAAT GAATTGAAAG CTTTCAATCC TGATATCTCA GATGAAACAT TAAATACTCT TTGGGAATGT TTAGAATCAT ATTTATGTAA TGTTTTTGAA CAGCATGGTA TACAAACACT ACATTTACTA AATCCCAATG CGAAAATTGA AGAAGACGAT CAGAAAAGTT TGACAGATAT AATTGAAAAT ATTATAAAAG AAAACAATGA TTCATTATCA AAAGAAATTT TAATTGCTTC AATTAATCAA TTCATAACAA ACGCAGACGA AACTCGAACA AATTTTATTT CCCAACTAGC TGATTCGACC TTCACAAGCT TTGCTTTAAC ATCGGATGCA GAAACTGTAA ATTTTTTAAA TGAGCGATAT AATAATCTAC AACTATTTCT AGACACAAAT TTTATTTTCG GAATTTTAGA TTTGCATAAA AATAGTGAAG ATGCTTCAGC TAGAGAAATT CTAGAGGAGG TTAAAAAAAA TAGGTTACCT TTTAGACTCG CTTACCACCC CGAAACCTTA GCCGAATTTA AAAGAGCTTT TGATGCAAGA GCTCTACACA TTAGGGCATC AAAATGGACT AGAGAAACTA GTAGAGTTGC AATAACTGTT GATGGATTAA GTCCATTAGA AGAACTATTT CACAAACAAA ATATAGATAA TGAAATTGAT ACCTCTGTAT TTTTGGACAA ATATGATCAT GTTGACATTA TTCTTAAAGA CCTAGGCTTA ATAGAATATA CACCACAAGC TTTTACAAGT GATGAGGAAT ATGTAGATAT TGAAAGTGAC ATTGAAAAAT ATCAAATATT TTATGATCCA TTAACTAATA GAAAGCCCAA ATCTTATCTT GGATTCAAAC ATGATGTTGT TGTAATAAGA GAAGTGCGTA GGCTTAATCC TAGAAAAACA AAATTTTTAG AAAGTCATGC ATTTTTTATA AGCTCAGATT ATATACTCGC AAAATTTGAA AAAAAGCATT ATAGAAGAAA TTGGGAAATA AATTATGTTG TTAGTCCGAG TGTCTTTTTA CAATTAATTA GACCATTTAT TGAAAATGAT TACTCTTCAA ATAAACGCTT TATTGATACT TTTTCTATTC CAGAATTGCG TGCTTTTGAA ATTGATTATT CATCAACACG TTCAAAAGCT CTTCAAATTT TAAACGATAA TTATCATGAT ACCTCTTTTG AAACAAAGGT GAAAATTTTA AGAGATCAAG TGATCTTAGA AAAATTAAAA AAGGCAAATG ACGATTTCTC AGTTCAACTT GAAATAATAG AAAATCAAAT TTCAATAGAA AATCAAATTC TCACAAAGCA GAAAGAAGAA GCTTTAAATG ATATTCAAAA AATACAAAAA GAAAAGGATA GAATTGAAGA AGATAAGTAC AAAGTAGAAG TTGAGAAAGA GACTGCTTTG ACAGAAATTG AAATTAAGAA TACTGAACTA AAAGTTAAAT CAGAAGAATT AATTGTATTA AAGAACACTC ATGCTATTTT AGAACTTAAA CAACAATTAG AATATAAACA ACAATTGCTT TCAATAACAG AAAGAAATAT TGAATCTTAT GATAAAAGAC ATCCTCCTAT AGCTGCTATA ATTGAAAAAA AGTTGAAAAA TCATAGATTC TATTGGGTAT TATCACCTGT TTTATTTTTT GCTTTTGTTA TTTTCCTTAT TTATAAATTA TCTTGGGAAA TCATGGAACC TTACACTTAT ATTTTTTCGA TGATAGCTGC ATTGGCGGGG TATTTATATT TTGCTATAAA TGGAGATAGT TTTGACCCGA GAAAATATTT TAAATATTAC AACAAAGGAA TCACTTCTAA AATATATAGT GAATTTGATT TCAATATTGC AGACCTTGAA TTGCAAAAGG AAACAAAAAA ATCTTTAGAA GATGAAATAA TAGGACTTAA AGCTGAAATT CAAAAAAAAC AAATCCCCTA A
|
Protein sequence | MTQIDYKKAF SKLSHIKALS HDQKFDQIVQ NLITLALNQK VEENPKNEAQ VAARINDIYG MSIRLPIILS NIDKLLSLGE ITKDPLSKQL HVTPVISNKL KQRLEVASQL ENRVKIGWYN ELKAFNPDIS DETLNTLWEC LESYLCNVFE QHGIQTLHLL NPNAKIEEDD QKSLTDIIEN IIKENNDSLS KEILIASINQ FITNADETRT NFISQLADST FTSFALTSDA ETVNFLNERY NNLQLFLDTN FIFGILDLHK NSEDASAREI LEEVKKNRLP FRLAYHPETL AEFKRAFDAR ALHIRASKWT RETSRVAITV DGLSPLEELF HKQNIDNEID TSVFLDKYDH VDIILKDLGL IEYTPQAFTS DEEYVDIESD IEKYQIFYDP LTNRKPKSYL GFKHDVVVIR EVRRLNPRKT KFLESHAFFI SSDYILAKFE KKHYRRNWEI NYVVSPSVFL QLIRPFIEND YSSNKRFIDT FSIPELRAFE IDYSSTRSKA LQILNDNYHD TSFETKVKIL RDQVILEKLK KANDDFSVQL EIIENQISIE NQILTKQKEE ALNDIQKIQK EKDRIEEDKY KVEVEKETAL TEIEIKNTEL KVKSEELIVL KNTHAILELK QQLEYKQQLL SITERNIESY DKRHPPIAAI IEKKLKNHRF YWVLSPVLFF AFVIFLIYKL SWEIMEPYTY IFSMIAALAG YLYFAINGDS FDPRKYFKYY NKGITSKIYS EFDFNIADLE LQKETKKSLE DEIIGLKAEI QKKQIP
|
| |