Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3525 |
Symbol | |
ID | 4072784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4170405 |
End bp | 4173305 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985548 |
Product | Cna B-type protein |
Protein accession | YP_592600 |
Protein GI | 94970552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGCTGG CTTCAGCGAT GGCGCAGACC ACAACAGGTT CACTGCGCGG AAGAGTTACG GACCCATCGG GTGCGGTGGT GCAGAACGCG ACAGTTACGG CGTCCACGGC TGATGGCAAA CAATCCTCTG CAAAGACGAA TGCGCAGGGC GCCTACGAAG TGCATGGACT CGCCCCCGGA TCGTACACCG TAACTATCAC CGCAAAGGGT TTCGCCGACG ACACCGAGTC TGCCGTGAAC GTCACGGCCG GCTTGCCGCA GCAACTCGAT GTTGCGATGC AGATCGAGGT CGAGAAGCAA CAGGTGCAGG TCCAGGAAGA CACCAACGCC GTTGACACGT CCTCGACGAA CAACGCTTCC GCGCTCGTTC TCAAGGGGAA AGATCTCGAG GCACTCTCCG ACGATCCGGA CGAACTCCAG AGTGAGCTGG AAGCTCTCGC CGGGCCCGCA GCCGGACCGA ACGGCGGGCA GATGTACATT GACGGTTTCA CTGGCGGCCA GTTGCCTCCC AAGGCTTCCA TCCGCGAAAT CCGCATCAAC CAGAACCCGT TCTCGGCGGA GTACGACAAG CTCGGTTACG GTCGCATTGA GATCTTCACC AAGCCCGGCC TCGACAAGTT CCACGGCCAA GTCATGGTGA TGGGCAATGA CGACGCCTTT AACGCGACGA ACCCATACGC AACGCAGAAT GGCACGCCGA TCCCGCCTTA CCACAGCGAG CAGTACTCGT TTAATATCGG CGGCCCAGTC ATTTCGAAGA AGGCATCGTT CTCGTTTAAC TTCGAGCGGC GCAACATCAC CGACCAGTCA ATCGTATTTG CGCAGACGCT CGATCCCACG ACCTACGATC CCCTGAACGT GAACGCAGCG GTCAATACGC CCCGCACCCG CACCAATATC AGCCCGCGCT TCGACTACCA GGTTTCTCAG AACAACACCC TTACTCTGCG CTACCAGTAC TGGCTCGAAG AGGACACAAA CAGCGGCATC GGCGGGTATT CATTGCCGAC GGTTGCCTAC AATTTGCGCA GCCCCGAGCA CACCTTCCAG ATCAGCGATA CGCAGGTGTT GAACTCGCAC GTCATCAACG AGACGCGGTT CCAATACGTG CGCGATCTCA CTGAACAAAC ACCGCTCAAC ACCATTCCCC TGATCAATGT GCAGGGTGCA TTCACTGATG GCGGCAATTC CAGCGGCTAT TACAACGATC ACCAGGACCG CTACGAGTTT CAGAACTACA CCTCGTGGGT CCACGGCAAG CACATGTTCA AGTTCGGAGG ACGCCTTCGC GCTACGCGTG AAGCCAGTTC GGTGAATTCG AACTTCAACG GGACGTACAC GTTCTCGTCG CTGCAGTCAT ACGCGATCAC GCAATATGGA ATCGATCATG CCGGGCCCAA CGGGCCTGAC TGGGCGGCGA TCCAGGCAAT GTGCACACCG CCTCCGGGAA GCTCGATCAC GCCGCAGCCC GATCAGTGCG GCGGCGCCAG CCAGTACTCG CAGACGTTCG GCACACCCGC CGTGGTTTCG ACTTACTTTG ATACGGGCCT GTACTTCCAG GATGAATACA AGCTGAAGCC GAACTTTACG CTGAGCTACG GCCTGCGCTG GGAGACCCAG AACGCGATCC ACGATCACAG CGATTGGGCG CCGCGCCTCG GAATTGCCTG GGGTTTTGGA AAGAGCGGTA AGACCGTGCT GCGCGCCGGC TATGGCATGT TCTACGACCG CGTCGGGATT GGCTCGATTC TCAATACCGA TCGCCTGAAT GGCGTGCTGC AACAGCAGTA CATTGTGCAG AGCCCGCAGT TCTTCGATTC CACGGCGCCG ATCTGCGATG CGCAAGGCAA TTGCAGTGTT CCAGGCGGGA CTACGGCGCA TGGCACGACC TACGAATTCG CTTCGCATTT GCGTGCGCCC TACACCATGC AGGCGGCGGC GAGCCTTGAG CGGCAACTTT GGAAGAATGC CACCGGTGCG CTGACTTACA TCAACACCCG CGGCGTGCAC CAGATGGTGC AGATCAACGC CAACGCGCCG TACTTCGACG ACTACAACCC AGCCCTCGGA AATATCTATC AATATTTCAG TCAGGGCATC TTCAAACAGA ACCAACTGAT GGCGAACATC AATTGGCGCG CTGGCAGCCG GTACACAATC TTCGGAAACT ACACTTACAG CCAAGCTCAC GGCGACGTGA ACAGCGGTGG CTTTGTTACC GATTCGGCGG ACATCTCCGC GGACTATGGA CGCTCCTCCT TCGATGTCCG GCACCGGATG ATGTTCGGCG GAAGTATGGG CTTGCCGTGG CTCTTCCGCT TCAGTCCTTT CGTGATATGG AATTCCGGCG GACCTTACAA CGTGGTGCTA GGCCAGGATT TCAACCTCGA TTCGATCTAC AACGATCGGC CGGCATTCGC GGGCGCCGTC AACAGCGGCA ACTTGAAGAT GACGCCATTT GGCTTGCTCG ACCTCACGCC GTTACCAAGC GAAACGCTGG TGCCCATCAA CTACGGGCAG GCCCCGGAGC AGTTCACCTT CAACATGCGC TTTGGCAAGA CCTTCGGAGT CGGGCCGAAG CTTGAGAAGA AAGCTGCGAA CGACGCCAAT GCCGGCGGAG GGCAAGGTGG TCCCCCTGGC GGCCACATGC ATGGACCGGG CGGCAATCCG TTCGGTGCTG GTGGCGGCGG CCGCGGTGGC GGCGGCGATG TCTCCGACCG GCGTTACAAC CTGACCTTCA ATGTGCTGGT ACGGAACCTG TTCAACAATG TGAATCGCGC AGCTCCGGTT GGCAATGTGA ACTCGCCGTT CTTCGGACAA TCCACCGCGC TCGCCGGCGG ACCATTCGGC AGCGGCGCGT ATAACCGGCG TATCGATTTC CAGGTGCAGT TCGCGTTCTA A
|
Protein sequence | MSLASAMAQT TTGSLRGRVT DPSGAVVQNA TVTASTADGK QSSAKTNAQG AYEVHGLAPG SYTVTITAKG FADDTESAVN VTAGLPQQLD VAMQIEVEKQ QVQVQEDTNA VDTSSTNNAS ALVLKGKDLE ALSDDPDELQ SELEALAGPA AGPNGGQMYI DGFTGGQLPP KASIREIRIN QNPFSAEYDK LGYGRIEIFT KPGLDKFHGQ VMVMGNDDAF NATNPYATQN GTPIPPYHSE QYSFNIGGPV ISKKASFSFN FERRNITDQS IVFAQTLDPT TYDPLNVNAA VNTPRTRTNI SPRFDYQVSQ NNTLTLRYQY WLEEDTNSGI GGYSLPTVAY NLRSPEHTFQ ISDTQVLNSH VINETRFQYV RDLTEQTPLN TIPLINVQGA FTDGGNSSGY YNDHQDRYEF QNYTSWVHGK HMFKFGGRLR ATREASSVNS NFNGTYTFSS LQSYAITQYG IDHAGPNGPD WAAIQAMCTP PPGSSITPQP DQCGGASQYS QTFGTPAVVS TYFDTGLYFQ DEYKLKPNFT LSYGLRWETQ NAIHDHSDWA PRLGIAWGFG KSGKTVLRAG YGMFYDRVGI GSILNTDRLN GVLQQQYIVQ SPQFFDSTAP ICDAQGNCSV PGGTTAHGTT YEFASHLRAP YTMQAAASLE RQLWKNATGA LTYINTRGVH QMVQINANAP YFDDYNPALG NIYQYFSQGI FKQNQLMANI NWRAGSRYTI FGNYTYSQAH GDVNSGGFVT DSADISADYG RSSFDVRHRM MFGGSMGLPW LFRFSPFVIW NSGGPYNVVL GQDFNLDSIY NDRPAFAGAV NSGNLKMTPF GLLDLTPLPS ETLVPINYGQ APEQFTFNMR FGKTFGVGPK LEKKAANDAN AGGGQGGPPG GHMHGPGGNP FGAGGGGRGG GGDVSDRRYN LTFNVLVRNL FNNVNRAAPV GNVNSPFFGQ STALAGGPFG SGAYNRRIDF QVQFAF
|
| |