Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0873 |
Symbol | |
ID | 4185227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 997088 |
End bp | 1000249 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638070875 |
Product | CHU large protein |
Protein accession | YP_677496 |
Protein GI | 110637289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.539179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATATT TGTACTCAGC AAAAAAACTA TTGTTATTAT TTTTACTTAC CTCCATCTGC TCATTAGGTT ATGGACAGAC AATAGGTACG TACAATTTTA ATTCAGGAAC ATGTTCTACA CAAACAGGCA CCGCGTCAGT TTCAAATGTA ACATTGAATG CTACTACTAC AGGTGCAGGT CTTACCTGCA CAATTGGATC AGGAGCAATT ACATTAACGT CCACCTCTAA CTGGCCAGCG TCGTTAACCT ATCCTGCAAA CAGTAATGCT TACTTAGAAT TTTCAGTAAC ACCTGCCGTT GGATATGAAG TAAATATCAG TCAGGTCATT GTAAAAGCAG CCAGGGGAAA TGGTGGCGCA AAAAATCTTA CTGTAGCATA TGACAACGGA TCCGGATATT CTACTGCGAC AAGTGCGTCT ATAGCTCCTG CAACAGTAAC TACTTCATCA CTTGCTTTTA CGCTTGATAT TCCGGATGTC TCGTCTACCT CCACTGTTAC ATTCCGTTTG TATGGATATA CAGGTGCAGT AACTTCTCCG AAATCCTTAA TCACAGATTA CATTCAAATC GACGGAAATG TTGCCATTGC CAGTCCAACG ATGCAATCTT CTATAGCTGT TACTTCAGCT ACAATCAATT CTGCAACACT TAGCTTTACG GGCGGAAATG GTTCTCAACG TTTAGTACTT GCTCAGGAAG CAAGCCCGAT CAGCGCTGCA CCGACTGATC TTACCTCCTA CAATGCCTCA AGTGTTTTCG GAGCATCCTC AACTCCTATT GGAGGTGCAG CTTTCCCTGT TTACATCGGT TCTGGAAATG CGGTAACAGT TACCGGCTTA AATCCATCTA CAACGTATTA TTTCTCCGTG TTTGAATTCA ACAAACTGGC TGGTACAAAT ACTGAAAACT ATTTACTGCC TGGCGGTTCA ACATCCGTCA AAACCAATAA AGGTATTTAT ACCTGGTCAG CAGGAAGTTC CGGATCATGG GCAAATCCTG CAAGCTGGAC ACCTAACAGA AATTCACCGG CCTTTGACGG TTCGGATTCG TTGATCTTTA ACAGCGGCGG TACTATTACG GTGACAAATG TTACTGATCA GGATGTATTT AGCGGACTTT CTATTTTTAA TAATACACAT TTAATATTAA GTGCATCTGC ACCAACAACT TTATATGCTA TAGATTTTTC CGGATATGAA ATCGCTATTG AATCCGGCTC TACTCTTGAG ATCAACAGCA CAAACGATTT CTTCCTATAT CTGGATTATA ATTCCAGCCT TACTTCAGAA GGCACATTAA TTCTTGCTAA AACAAACAAT ACCATCATCG GTACAGGAGA TATAACAATC AATGGAACAA TTTCGCTCGT ACATCCTGAT GGACTTTACG GTACATCAGG ATCCAATGCC ATTGATGCAG GAGCAACATC CCTTACGCTG GGCGCTGCAA GTACTGTAAA CTATGCCGGT GCAGCACAGA CAATCACTAC TGCCAATCCT ACCCCATATG CAAACCTTAC ATTAAGCGGT TCTGGCACTA AAACACCTGA TGGCAACCTG GATGTACACA ACCTTACCCT GAGCGGTACT TCGGTATTAG CACTTGCTGC TTATGAGTTA AGTGTAGCCG GTAACTGGAC CAGCTACTCT GCTGCAGCAT TTACGGAAAC AGGCAAAACC GTAACATTCA ACGGTACCGC AGCACAGACT CTTTCAACTA CAGGAGGTGA AATATTTGAA GGGCTAAATC TCAACAACTC CTCTCTTACA TTATCTTCTC CGCTGCGCGT AAACGGACTG GTAACACTTA CCGCCGGTAC ACTTACCGCC GGCGGAAATC TTACCCTGAA CCTGGCAACA GCTAAAATCG ACGCTGCTGG TTCTGGTTCG ATCAGCGGAA ACATGAAAGT GATCCGTAAC TTTGTACCGG CCAAAACGCA TTATATTGCC TCTCCCTTAG CTGGTGTTAC CGCTGCTGAC ATCGCCGATG AATATCCTGT AGTATCAGGA AGCCAATCCC GTTTACTTGA TCTGGACTGC TCTACAAACA AATTCTTTGG TATTTATGAC ATGAGTACTC CGCTGGCACA GGGGGATGCG CTTTCTATGT TCTTCCCTGC TTCAGGAACG CCTGCAAACG GAGTGAACAC CGTTACATTC ACGGGTACTT ATAATCATGC TGCTGCAAGC TATTCTTCTT CTTGCCCGGT AAATACTTCT GTAAAAGACT TCTTTGCAGG TAACCCTTAT CCGTCTAACT TATTCTTAGG TGGAATTACC GGATCAGGTA CTTCAGGTAA TTACTATGTG TTTAACAACA ATACATTCAA TGTATATTCT TCTGTAACAG GAATCGGTAC TGGTGTAATT GTGAATGGAT ATGTGGCACC CATGCAGGGC TTCCAGGTGG AAACTGACGG AACGGGCGGA ACCGCTTCGG TAACAATCCC TTCAAGTGCC AGAGATGTTG CACAGACTAC TTCCTATGCC CGTACCGCTG TGGCTGATAA CATCATCCGC TTACAGGCGT CAAACGGTAC ATTTACCGAT GAAACCGTTA TCCTGTTTAC TGATGCTGCT ACAAATGGTT TTGATCTGGG ATATGATGCC GGTAAAATGC GCAATACTGG AACGAACACT CCTAACCTGT ATACATTAGC CGATACAACA AAACTGATCA TTAACAGCCT GGCGCCCCTT ACAAATACCG TTGATATTCC CTTGAACATC ACTACTACCA CTACTTCTAA CTATACCCTG TCTTTCACAA ACAAAGACGC TTTTGATACT TTCAAATCGG TATACCTGAT CTTCCCGGAT GGAGCACTGC ACAACTTAAC ACAGAACCCT GCCGTAACCG TGGCAACAAA CCCGGCAACA CCATACATAC TACGTGTAGG CACTGAAAAC ATTACTACTT CAGCTTCTAA AGCAAAAACA AATAATGCTT TAAAAGCTTA CATGAATGAA GATATGCTGG TTGTTACAAT GAACAGCAAC ATTCAGCAGG TGGAAGTGTA TGACCTAACA GGAAACAGAA TCGCGGAAGG AACAACGGAT GCGGGTTCCT TCACTACTAA ATTATCTGCT AAAGCGGGTA TTTATGTGGT AAAGGTACTT TCTGAAAATA ACCTGTACAC AACTAAAATA GCAGTTAAAT AA
|
Protein sequence | MKYLYSAKKL LLLFLLTSIC SLGYGQTIGT YNFNSGTCST QTGTASVSNV TLNATTTGAG LTCTIGSGAI TLTSTSNWPA SLTYPANSNA YLEFSVTPAV GYEVNISQVI VKAARGNGGA KNLTVAYDNG SGYSTATSAS IAPATVTTSS LAFTLDIPDV SSTSTVTFRL YGYTGAVTSP KSLITDYIQI DGNVAIASPT MQSSIAVTSA TINSATLSFT GGNGSQRLVL AQEASPISAA PTDLTSYNAS SVFGASSTPI GGAAFPVYIG SGNAVTVTGL NPSTTYYFSV FEFNKLAGTN TENYLLPGGS TSVKTNKGIY TWSAGSSGSW ANPASWTPNR NSPAFDGSDS LIFNSGGTIT VTNVTDQDVF SGLSIFNNTH LILSASAPTT LYAIDFSGYE IAIESGSTLE INSTNDFFLY LDYNSSLTSE GTLILAKTNN TIIGTGDITI NGTISLVHPD GLYGTSGSNA IDAGATSLTL GAASTVNYAG AAQTITTANP TPYANLTLSG SGTKTPDGNL DVHNLTLSGT SVLALAAYEL SVAGNWTSYS AAAFTETGKT VTFNGTAAQT LSTTGGEIFE GLNLNNSSLT LSSPLRVNGL VTLTAGTLTA GGNLTLNLAT AKIDAAGSGS ISGNMKVIRN FVPAKTHYIA SPLAGVTAAD IADEYPVVSG SQSRLLDLDC STNKFFGIYD MSTPLAQGDA LSMFFPASGT PANGVNTVTF TGTYNHAAAS YSSSCPVNTS VKDFFAGNPY PSNLFLGGIT GSGTSGNYYV FNNNTFNVYS SVTGIGTGVI VNGYVAPMQG FQVETDGTGG TASVTIPSSA RDVAQTTSYA RTAVADNIIR LQASNGTFTD ETVILFTDAA TNGFDLGYDA GKMRNTGTNT PNLYTLADTT KLIINSLAPL TNTVDIPLNI TTTTTSNYTL SFTNKDAFDT FKSVYLIFPD GALHNLTQNP AVTVATNPAT PYILRVGTEN ITTSASKAKT NNALKAYMNE DMLVVTMNSN IQQVEVYDLT GNRIAEGTTD AGSFTTKLSA KAGIYVVKVL SENNLYTTKI AVK
|
| |