Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0838 |
Symbol | |
ID | 4185742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 951540 |
End bp | 954680 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638070842 |
Product | hypothetical protein |
Protein accession | YP_677463 |
Protein GI | 110637256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACCT TCTTTCATCT TCAACCAGTA AATGTATTCC TTAAAAGAGT TGTTTTTACA ATCTTCTTTT TAAATACATG TTTTTTGGCT GTGGCAGAGT TAAATTCAAC TTCCAGCCAG ACAATATCGA CCTCACAAGC ATATACAGGT GCCAGCAAAA TCGCAAACGG CAGTACGCTA ACTATTTCGG GGGGGACAAC AACGTTTAAT GGTACTTTAA CCGTAGGTAA CGGTGCTGCA GGTACATTAA TTGTAAAAAG CGGAGCTATC GTGATTGTAA ACGGTACCTT ATCTATTGGT ACATCCAGCA ATGGTGCTGT AGTGATTGAA AACGGCGGTC AATTGATTGT TAATCCGGGC TCAGCAAGCG GGACAGCCGC AATCAATATC AATAGTTCAG CAGCAACGGC TCTGGATATT CAGACAGGTG GCTCCGTGAT TGTAAAAGCA GCCGCATCAG GCAACAACAT TTTAGGCATG TATCAGACGG CGGCATCGAA TGTAACTGTT GCCGGTGCTC TTCAGATCAG AGGAGGCGGG TTTAAAATGG ATGGTACTTC AAAATTAACG TATTCAGGTT CGGGTAACGA TACCATTATA GGTACACCTA ATACTTCGGG GGCATATTTC TCAAATTCAA CAAAGTTAAC GGTTGGCTCA AATGTTAACT TATATATAAG CGGTGATGTT AAAAATGATT CGAATGCAGA GTTTGCCATT AATGGGAATG TGTCTATTAA CGGTAATTAT CAGTCAGGAA ACAATACAGC AAATGTTACC GGTACGGGTA CATTAACAAC AACGGGTTCA TTAAATTCCG ATAGTTACCA GGGTTCTGTT TTTGGAGAAA GATACAGCTG TCCGACAGGT CCTTGTGGGG GCAGTAACGT CATTACTTCA AGTAATATGA CCACCTGTAA CGGAGGCAGT GTTATAATAA CCGGGCCAGC TATTGCCGGA GCTGCTTATA AGTGGCTGGT AAGTGTTACC AGTGCTTCAA GCGGCTTTAC AAATGCCCCG GGTGCAAATA CCGGACAGAA TTATTCAACT ACCGGTTCAC CGGCTGCTCA ATATACCTAC TACAAAAGAC AATACACATT AAATGGTGTT ACAATAAACA GCAATACATT GACCATAACA TCCTCATTCT GGGTAACAGC TTCAACAACT CCGGCTATTG GAGGTGCAAG CACGGTATGT ACAGGTTCTG CTATAACACT TACAAATTCA TTAAGCGGCG GCCGATGGAC AAGTGCTTCT CCCAATATTG CAACAGTATC TTCTTCCGGT GTGGTAACAG GTGTTTCTGC CGGGACTTCT GTAATAACCT ATTCAGGCAA CGGATGTTAT ACACCTTCCA ATAAGACAAT AACGGTATCA TCCTGTGTGA TCACCTGGAC GGGGGCAACA AACAGTGCCT GGACAACCAT GTCAAACTGG AGTTCCGGTT CTGTTCCTAC AGCATCAGAT AGTATTTTAA TTCCGGCAGG TGCACCTAAT TACCCTCAAA TTTCAACAAG TGTATCCAGT TATAAAACAA CGGTTAATTC CGGAGCTTCA TTAACAATTA CTTCATCTGG TGTATTAAAT GCATATGGGA ATATTATTAA TAATGGTACA GTTACAACGA ATGCCGGTTC AACGGTAGCG TTTAAAGGAA GCAGTGCACA GACAATAAGC GGTGTTCCGG TATTGTACAA TGTACAGGTT GCAAATACTT CTGGTGGTGT AGCCTTATCT TCAGCGGTTA CATTAAAAGG AGCATTAACG TTAACAAGCG GTGTATTAAC AACCAACGCT AATCTTACAG TGAATTTTGA TAATGGTGGT AACATAGCCT ACAATTCAAC AGATGCGGGA AGTATCAGCG GTAATGTAAC CGGCCGCCGT GATGTGGTAG CACGTACACA TTATATCAGT GCGCCTTTTA ACGGTGTCAC ATCTGCACAG GTTGGAGCAA CCACACCATT GTACTATAAC AACTACTGGA AAATGTATGC AAAGAATTTT GCTACACAAG GGTGGACAGC TGTAACAGAT GTGACTACAG CAATGCCTCT GGGTACTGGT TTCTCTATTG CTTTACCGAA TGCCGCTCCA TTGATCTTTA CCGGAACGTA TAATCATAGC TATACATTAC CGGCAACAAC TTACTCCAAT GCAGCAGCAG GTAAATATAT CTTAATCGGA AATCCATATC CATCAGCGCT GGATTGGACA AGTGCGGCAG GCTGGACAAA AACAAATGTG GCCAACGCTA TTTATTACTG GGATGCAGCA AGCAGTAAAG TTTCTTCATA CGTTGCAGGT GTAAGCACAA ATGGCGGTAC ACAGTATATA CCGGCAATGC AATCATTCAT GGTGTCTACA ACAGGTACCG GAGGTAATTC AAGTGTTGCG ATCAATAATG CAGCGCGTGT TAGTTTACAG AATCCTTCGT TCTTACGTAA CGGTTCGGAT GAAACTGTCC GAATCAAAAT AACAGCAGCA AATGCAGAAC AGTGGGATGA TGCTGTTGTA CGCTTTAATG AGATGGCAAC AAATTCATTT GATGATGACT GGGATGCGTA TAAGTTATTA AGCCGTGGTC CGAGCCCTTC TGTTTATACG GTACTTGGTG AAGATATTTA TTCTGTTAAC TCTGTAGCGC ATCCTTCAGC ATTGCCAATC ATTGATCTGG CTGTATATAT TCCGGCAGAT GGTAACTACA GCTTAACGAT TACTAACAGT GACCCTGCTA CAGACTATGT ATTAATAGAT AAAAAATTAG GAACAGAAAA TCTTTTATCC GGATCCGATT ATATGTTTAG CGGACTGCGT ACGGATGATG GAAACCGTTT CCAGCTTCAG TTGCGTATTT CAGAGAATAA CATTACTACA GGTACTACAG CTGCACAAAA TAACAAAGGG CTTCAGATTC ATTCAACAGA TAAAGGCTTT GTGGTTCAGA CAGATGTTTT TGGAGGCAGC AAAGCTACGA TTGAAGTTCT GGATATGAGC GGAAAATCTG TAGCTGCTAT GGAAAACACA CTCGCAGCAG GTGCAACATT TATTTCTTCA GATCTTTCAG CAGGTGCTTA TCTGGTTAAA GTAACGGTTG ATGGAAATAC GTTTGCGGGA ATGATCAGTT TACTAAAATA A
|
Protein sequence | MNTFFHLQPV NVFLKRVVFT IFFLNTCFLA VAELNSTSSQ TISTSQAYTG ASKIANGSTL TISGGTTTFN GTLTVGNGAA GTLIVKSGAI VIVNGTLSIG TSSNGAVVIE NGGQLIVNPG SASGTAAINI NSSAATALDI QTGGSVIVKA AASGNNILGM YQTAASNVTV AGALQIRGGG FKMDGTSKLT YSGSGNDTII GTPNTSGAYF SNSTKLTVGS NVNLYISGDV KNDSNAEFAI NGNVSINGNY QSGNNTANVT GTGTLTTTGS LNSDSYQGSV FGERYSCPTG PCGGSNVITS SNMTTCNGGS VIITGPAIAG AAYKWLVSVT SASSGFTNAP GANTGQNYST TGSPAAQYTY YKRQYTLNGV TINSNTLTIT SSFWVTASTT PAIGGASTVC TGSAITLTNS LSGGRWTSAS PNIATVSSSG VVTGVSAGTS VITYSGNGCY TPSNKTITVS SCVITWTGAT NSAWTTMSNW SSGSVPTASD SILIPAGAPN YPQISTSVSS YKTTVNSGAS LTITSSGVLN AYGNIINNGT VTTNAGSTVA FKGSSAQTIS GVPVLYNVQV ANTSGGVALS SAVTLKGALT LTSGVLTTNA NLTVNFDNGG NIAYNSTDAG SISGNVTGRR DVVARTHYIS APFNGVTSAQ VGATTPLYYN NYWKMYAKNF ATQGWTAVTD VTTAMPLGTG FSIALPNAAP LIFTGTYNHS YTLPATTYSN AAAGKYILIG NPYPSALDWT SAAGWTKTNV ANAIYYWDAA SSKVSSYVAG VSTNGGTQYI PAMQSFMVST TGTGGNSSVA INNAARVSLQ NPSFLRNGSD ETVRIKITAA NAEQWDDAVV RFNEMATNSF DDDWDAYKLL SRGPSPSVYT VLGEDIYSVN SVAHPSALPI IDLAVYIPAD GNYSLTITNS DPATDYVLID KKLGTENLLS GSDYMFSGLR TDDGNRFQLQ LRISENNITT GTTAAQNNKG LQIHSTDKGF VVQTDVFGGS KATIEVLDMS GKSVAAMENT LAAGATFISS DLSAGAYLVK VTVDGNTFAG MISLLK
|
| |