Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1854 |
Symbol | |
ID | 4185924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 2171808 |
End bp | 2174987 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638071852 |
Product | hypothetical protein |
Protein accession | YP_678462 |
Protein GI | 110638253 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00800733 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATCCC TGTTTGTATC CATTACCTTT TTTTGTTTTA GTATACTGAA TCTTTCCGCG CAAACCATTA CTTTTCCGGA TGCTAATTTT CAAAACGCAT TGCTTACACA TTGGCCGGCT ATTGATATAA ACAACGACGG ACTTATTCAG ATAAGTGAAA TAGAAAATCT AACAAGTTTA TATGTTGCAG GTAAATCAAT TTCAGATTTA AGCGGCCTTG AGGCATTTCC CAAATTAGTA AGTTTAAACT GCAGCAATAA TAGTTTATCA CACATAGATC TGTCTCACAA TCCGGAGTTG AAATTTTTAG AACTCGGCTG GAATAGCATT TCGAATATTG ATCTCAGCAA AAGTACGAAG CTGCAGGTCT TAGGGCTTCA GGATAATGGC TTGACCTCTA TTGACGTAAC CAGTAATAAA GATCTGCGGG AATTAAAAGT AGAATATAAT GCATTAACGC AGCTGGATGT ATCAGAAAAC AGATTCATCT GGTATCTGAA TTTTTCTGAC AATCAAATAA GTACGATCAA TTTAAATCCT ATCCGTTCAT TAAGTGTACT TTCTGCCGGA AATAATTTGC TTACTTCACT CAACTTAAGC TGTCACACCA ATTTGTCTTA CGTTACAATT GACAATAACT CATTATTAAA TTCAGTATGC CTGAATAGCT CGGATTATGC AGAACGCATG TCAACTACAG ACCCAAACAG TGCGTATTGG ACGAAAAGTT CAAACACTGC CTGGGCAAAT TGTCCTTCGC TTCCGCTTGT TACACCTATA CCGGCAACAA TTGTACCTAT TAACCTGTGT GAAACAGGAG ATATGACATT CAATGCCGTT TCAGAACAGT TTATCTCTAA CCCAACCTAT CGCTGGCAAA AAGCAAACTC TGCTGCAGGG CCATGGGTAA GTATTCCCGG CGCAATCAAT CAATCTCTCG TGCTTACAAA CCTGCAGATG AGCGATAATG GTTCGATATA TAAAGTGCTG ATCAGCTCAA CGGATTATTG CGGAACAGGA CATATGGAAG AGGCAGCCGG ACAAATCATA TTACATCCGG CGGTTTATCC AAAGGTTACA GCAAGTGCTT CTGTAAGCGG CGCCATATGC GATAATGCAT TGCCAATCAC ATATACAGCA ACGCCTGTAG CCGGACAAGG CTCTGCTCCT GCGTATCAAT GGTACTATTA TAACAATCCC GGTTTCACAG CTATAGCAGG TGCTACCGAT ATTCAGTATA CACCTGCAGT GCTTCCGCCA GATGGCACAC AATTATTTGC AGGTATGTTT ACAGATGAAA TGTGTGCGGT TGAATCCGGA GTTGCATCCA ATATCCTTAC TGTTGCTGTT GTACCAGCTC CCGAACCAGT GATAACCACA AGCGACATCG CTCTATGCAA TCCTTCCGGA TATGTCATAC AGACAAGCCA CACTGCGGCA ACAGGCACAC AGTTTCAATG GTACAGAAAC GGTATTGCCC TTGCTGGAAG TAATGTACGG AATCTTACCA TTCCTGGTGC CGGCACCTAT TATATGACTG AAGATAATGG CACCTGCACT ACAGCTTCCG GTACTGTAAC AATTTCTACA TCATCTACTA CGCCAGATCC GCAGGTGAAC GTCAGTTCTT CCCTTACGGG CGCAGCCTGC GACACGCTTC AAAAAATTAC ATATACTGCC ACAGCAGTTG CCGGAACGGT TACAGCCCCA ACCTATCAAT GGTACAATGC TGTTACAGAT ACACCTATCA GCGGCGCAAC CGATGCAGTA TACACACCTG CAGCCAAACC TATGAATGGA GACCAGGTCT ATGTACAGAT ACACACAAAC GAAGCCTGTG TAGCAAATCC GGATGTAAAA TCCATTACGT TAACTACTCA AATACTTACT ACACCTGCCC CGCTTATTAC ATCCGGCAAT ACAGCATTGT GCAATCCATC CGGATATGTC ATACAGACAA GCCAGACCGC AGCAACAGGT ACGCAGTTTC AATGGTTTAG AAATGGCAGT GCATTAACAG GAAGTAATAC ACCTACGCTT ACCATTGCAT CTGAAGGCAC ATATTATCTG GTTGAAGATA ATGGTACCTG TAAAATATCC TCAGGCAGTG TAACAATTTC TGCTATAGCC GCTATGCCGG ATCCACAGGT AAACATAAGT TCTTCACTTA CCGGTACTGC ATGTGATACA CTTCAAAAAA TAACATATAC AGCCACACCA GTTGCCGGCA CGGTTACAAC TCCAACCTAT CAGTGGTATA ATGCTGCAAC AGATACCCCG ATCAGTGGAG CAACCGATGC GGTATATACG CCTGCTGCGA AACCTGTAAA CGGTGATAAG GTATATGTAC AGATGCATTC AGCTGCAACA TGTGTAGCAC ATCCGGATGT AACGTCTGCG ACCCTAACCG CTCACATCCT TACAACGCCT GCACCTGTTA TGCTGTCAAG GGACACAGCT ATATGCATGC CGCAGGAATA TAAACTTTCA GCTAAACTTG CTTCGGGAAC TCAGATACAA TGGTACAAAA ACGGCACACC AATAACGGGT AGCAGCCATG CTGCTTATAC GATTCATGCA GATGAATTAT CGGGCGGTTC GTATCATGTA TCTGAAAGCA ATGGCGCCTG TACCATTCAT GCAGATGCTG TAAACATTGA ACTGATGCAT ACACCATTAT TATATACAGA AAATGAGTTA TATGTTTCCA AAGGAGAACG TGTTACATTA AATGTACAGG CAGAGCACGC AGCATATCAT CATTGGAGTC CGGCTATTGG ATTGAGTAAT CCGGATCAGC TGATAACAGA ACATCTTGCA ACCAACTCTG TTACTTACAC ACTTCATGCA ACAAATCAGT TAAATAAATG TCCGGTAAGT ACTGAAGTAA CGATACGTGT GAAAGCAGAG GTAATTATAC CCAATGTAAT TACAGTGAAT GGCGATGGTG TAAACGATAC CTGGGAAATT GAAAACATAG AAAACTTTCC GCAGGCCTCC ATTGAAATAT TCAACCGCTG GGGAAATATG GTCTGGAAAA GTATCGGTTA CAGCAATCAA TGGAACGGCA GCAGCAGAAA TGGTGAGCCA CTACCCGCAG GCACATATTA TTATATCGTT AATCTCAACA GCAAACAACA AACGGATGCG TATACAGGCT ATATTCAACT AGTAGAATAA
|
Protein sequence | MKSLFVSITF FCFSILNLSA QTITFPDANF QNALLTHWPA IDINNDGLIQ ISEIENLTSL YVAGKSISDL SGLEAFPKLV SLNCSNNSLS HIDLSHNPEL KFLELGWNSI SNIDLSKSTK LQVLGLQDNG LTSIDVTSNK DLRELKVEYN ALTQLDVSEN RFIWYLNFSD NQISTINLNP IRSLSVLSAG NNLLTSLNLS CHTNLSYVTI DNNSLLNSVC LNSSDYAERM STTDPNSAYW TKSSNTAWAN CPSLPLVTPI PATIVPINLC ETGDMTFNAV SEQFISNPTY RWQKANSAAG PWVSIPGAIN QSLVLTNLQM SDNGSIYKVL ISSTDYCGTG HMEEAAGQII LHPAVYPKVT ASASVSGAIC DNALPITYTA TPVAGQGSAP AYQWYYYNNP GFTAIAGATD IQYTPAVLPP DGTQLFAGMF TDEMCAVESG VASNILTVAV VPAPEPVITT SDIALCNPSG YVIQTSHTAA TGTQFQWYRN GIALAGSNVR NLTIPGAGTY YMTEDNGTCT TASGTVTIST SSTTPDPQVN VSSSLTGAAC DTLQKITYTA TAVAGTVTAP TYQWYNAVTD TPISGATDAV YTPAAKPMNG DQVYVQIHTN EACVANPDVK SITLTTQILT TPAPLITSGN TALCNPSGYV IQTSQTAATG TQFQWFRNGS ALTGSNTPTL TIASEGTYYL VEDNGTCKIS SGSVTISAIA AMPDPQVNIS SSLTGTACDT LQKITYTATP VAGTVTTPTY QWYNAATDTP ISGATDAVYT PAAKPVNGDK VYVQMHSAAT CVAHPDVTSA TLTAHILTTP APVMLSRDTA ICMPQEYKLS AKLASGTQIQ WYKNGTPITG SSHAAYTIHA DELSGGSYHV SESNGACTIH ADAVNIELMH TPLLYTENEL YVSKGERVTL NVQAEHAAYH HWSPAIGLSN PDQLITEHLA TNSVTYTLHA TNQLNKCPVS TEVTIRVKAE VIIPNVITVN GDGVNDTWEI ENIENFPQAS IEIFNRWGNM VWKSIGYSNQ WNGSSRNGEP LPAGTYYYIV NLNSKQQTDA YTGYIQLVE
|
| |