Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_2139 |
Symbol | |
ID | 4187063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 2483868 |
End bp | 2485934 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638072139 |
Product | CHU large protein, SAP or adhesin AidA-related |
Protein accession | YP_678744 |
Protein GI | 110638535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.534536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAT TCTTAGGCTT ATCTGTTCTT ATATATATGC TGCTTTCGGG TATGATGGCA GCCGCACAAT GTCCGTCAGA TCCTACTGCA TCCTGCACGA GCACGATAAG CGCATCTTCT GTTACAAACG TAACGGTTGC TGCGGGGCAG GTTGTGTGTG TTACCGGCGG CATTCATACC GGTACATTTA ACGTTCAGAA CGGCGGTATC CTATTGGTGT CCGGTGGTAC AATCGGCCCA CAGGTTGCTG TACAAAATGG CGGTAACCTT GTTGTTTCAG GAGGTATATT AACAGGGAGC GCCAGTTTGC CTGCGGGGGC AAGTATGTTC ATAAAAAATA ATCCGGCATT AAACGGTACA CTTGCTATGT CTGGCGGTAC CTTAAACGTA TTAAGCGGCG GTACACTTGC TAAAGACATT AGTGCTACGG CTGCAAGTAC CATAAATAAC TGCGGTACAC TTTCCGGCAC ACGTAACTTC AATACAAATA TCACACTAAA CAATTATTCA GCCAGCACTA TAACATTAAG TACCGGCGGC AATACATTGA ATAATTATGC CGATAACGTG ACTATTGCTT ATAACAACGT AAATACAACA AATACATTCA ACAACTACGG TACAGGCGTG AACTTCAGTA TAACAGGCGC ATGGAACAGC GGGATGACCT TTAACAATGC TGCCGGTGCA GCCTTAACGG TTACTTCCGT ACCAGGTGGA GCGATGCCTT CTGCTACGGT GTTCAATAAT GCAGGAACAC TTACCTATTC GCCGGTTTTA AATACAAATG GCGCAACCTT CAACAATGCT GCAACAGGTA CATTTAATCT CTCAAGTTCC GCATCTGCAA ACAGGCCTAA TATTACCAAT GATGGTACAA TGAATGTTTC AGGTACCTTT TATTTAGCCG GCAATACAAC AACCAACAAT GGCACCATGA CCTTTTCTGA TGAGCTGCGT CTGGACGGCG GCACATTGAA TCTTGGTGCT AATTCAACTA CCACAACAAA AACGTTATAT AAAAACAACG GCAGCATTAA CATGTATGAC CATAGTGTTT TAAATATTGT ACAAAATGTT ACTACATGGA ACGGCACTGC TATTCACCTG GTATCGGGTT GTGCTTCGGT ATTGGGAAGT ACTACGCCAA GTACAACAAA TATAAACGCT ACTTTCCTGG ATAATGCAAA TATCAATTTC TGCGGTGCTC CTCCTGCGCA GTCGCCATCA TATATTGCAA TCACGTCGGT AACCAATAGT GCATCCAGTC CGGGCAGATA CAGAATTGCC TTATCCTCCG GGCCTGCAAC AAACGGTTAT GTTCAGATCT CCGGTGTAAC AGGTGTTTCA GATTTGAATG GTTACTGGCA GGTGATCAAT AACGGAAACG GTACATGGGA CCTTATCGGC AGCACATACA CAGCAGGCGC AGTGATCAGC GGCAGCCAGG TAATCGTGGA TCAAACCAAA TTGAAATTAG GTCCAGGCTA TTACTTAGGC TATTCCGGCT GCAGCAATCC ATGTGCTCCA CTGCCGATTA CGTTACTCTC ATTTACCGCA GAGAAAGAAG ACGCGCATGT GGTTATTGAA TGGGTAACAC TGCAGGAAAA AAATAATCAG TCGTATACAG TAGAAAGATC TGCAGACGGC ATTCATTTTG AATCTGTTGT TTCGCTTGCA GGAAATAAAA ACAGTTCACA AAAAATGACC TACACGCAGT ATGATTTCAG CCCGTTACCA GGCGTAACGT ACTACAGATT AAAACAGACA GACATGGATC AAACACATTC GTATTCAAGT GTTGTTGCAG TAGATTCCAA TTCAGAAATA GATTGGACCA TCTACCCGAA CCCAAGTACA ACGGGCGATT TCACGATCCT TTCTGCCTTT GCCGATAATG AAATTGTTGC TGTTACCGTT ACAGACATGA CTGGCAATAC GGTTAGATCA TATGATGAAA GCTCGTACGA ACAGCAAATG GAGATCAGCA ACCTGGGTAT GGGTTTGTAT GTGGTGAGTA TTCAAACAGT AACAGGCCAG AAATCTAAAA AAGTAATTGT TCAGTAA
|
Protein sequence | MKRFLGLSVL IYMLLSGMMA AAQCPSDPTA SCTSTISASS VTNVTVAAGQ VVCVTGGIHT GTFNVQNGGI LLVSGGTIGP QVAVQNGGNL VVSGGILTGS ASLPAGASMF IKNNPALNGT LAMSGGTLNV LSGGTLAKDI SATAASTINN CGTLSGTRNF NTNITLNNYS ASTITLSTGG NTLNNYADNV TIAYNNVNTT NTFNNYGTGV NFSITGAWNS GMTFNNAAGA ALTVTSVPGG AMPSATVFNN AGTLTYSPVL NTNGATFNNA ATGTFNLSSS ASANRPNITN DGTMNVSGTF YLAGNTTTNN GTMTFSDELR LDGGTLNLGA NSTTTTKTLY KNNGSINMYD HSVLNIVQNV TTWNGTAIHL VSGCASVLGS TTPSTTNINA TFLDNANINF CGAPPAQSPS YIAITSVTNS ASSPGRYRIA LSSGPATNGY VQISGVTGVS DLNGYWQVIN NGNGTWDLIG STYTAGAVIS GSQVIVDQTK LKLGPGYYLG YSGCSNPCAP LPITLLSFTA EKEDAHVVIE WVTLQEKNNQ SYTVERSADG IHFESVVSLA GNKNSSQKMT YTQYDFSPLP GVTYYRLKQT DMDQTHSYSS VVAVDSNSEI DWTIYPNPST TGDFTILSAF ADNEIVAVTV TDMTGNTVRS YDESSYEQQM EISNLGMGLY VVSIQTVTGQ KSKKVIVQ
|
| |