Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0865 |
Symbol | |
ID | 4068959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1076820 |
End bp | 1080416 |
Gene Length | 3597 bp |
Protein Length | 1198 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982874 |
Product | hypothetical protein |
Protein accession | YP_589944 |
Protein GI | 94967896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.291273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG GCTTGCGAAC ATCTGCACGT CTGGCAATTA CGAAACCGTC CCCGGCCGGA AAGGCCGCTA CGCCCGCGAA AAATGCGACG AACTACTTCT CGGCCGACCA GTCCGTGCAG GAAGCCTCCC AGCCCGAGTA TTCATTCTCA ATCTCGAAGC TGAACCTCAC CGCTCACGGG GACGATCCGA ACCCACGTCC ATCTGCGTTT GCTGGGCATA ACCGTATCTC GAGCGACTCC CTCGAACGGG AGGCGGACGA CATGGCTGCG GCGGTACTAC GCCGACCGCT CAGCTCGTCG CTGCAAACAG CTCGTTCAGA GCCGGTTGTC CAGCGGAAAT GCGAATGCGG CGGCACGTGC GACAAATGTA AGCAAGAAAG GGCCGCCCAG GGATCGATTC AGAAGAAGAG CGACGACAGT CTACGAGAAC AGTCCTTAGA ACAGTCCGGG GTTGACAGGA TTGTAAACAC GCCTGGCGAA CCGTTGGAGC GCGAAACTCG CTTAATGATG GAGGAACGAT TCGGCTGGGG ATTCGAAACC GTTCGGATAC ACGCCGACGC CGCAGCGGCC CGCTCCGCGA GGGCTCTTGC GGCGAAAGCT TACACGCTGG GCCAGCACAT CGTGTTTTCA GCAGGTCGCT ATTCACCTGG CACAGCCGAG GGCCGATCTC TTCTCGCCCA CGAACTCGTC CATACCCTCC AACAACGCAA TATCAATTCG TTCCCGTCCC AACTGCCTAT GAGCTCACGG GGTTCACCGG CCGAAGTGAA TGCCCGGCGT ACCGCCGAAG AAACAGCGCG CGGTGACCAG AGAGGGCGAA GCCATGCTTC GGTGATGGTG CCTCGTACCT CCGCCCAGAT CGCACGCGAT GAAGATATGC CTGAGTTGAC CGGCGACGAG ACCGTCGTCT TGATGAAACT CAGATTTGCG AAGGACCTCT TGCGCCAGAC GTACAGTCGC GAAGTTCTCT CAGGAATGAT CGACAAAAAA ATGTTGATTG ACGCGAAGAC TCCGTCCAGC AACCGTCTGT ATGTCGTCTT CCACCCCTGG CGCGACGACG CTCCTGACAA ACGCGACGAA AGTATCTTCG AACTCAATGA CCTCAAGATC AAGCCGACGA CAGCAAGAAC CATGATAAAG GTCGACGGCA GGCAGGAGAT CGGTGACATT GCTCTAGAGC TTTTTGCCAA TTACAAGAAA ACCACGGGCG GGAGCGATAC AGACGTCTCT GACAAGTACA TCGGCCAGGA AGACAAGTCC AAGAGCCGCG AAGAGCACGT CAAGCAAATG GTGGATTCGG TTTTGTGGAA GCAATACGAA CTCATCAATA AGAAGCAGAG CTTCGAACAG ATCGGTTACA AAGTCACCAC CTCCGTCCCG CACGAAAAGG AATACGACGA TGCTTTTCAC AACGTCTTTA TGGCGGAGAT CGCGAAGGGC AAACTTCCGG CGGATGCTAT AGGCCCTGCA AAAGACGCTG CGATAAGTGC TTTGAATGCG GCGATTGCGA AGGAAGGTTA CGCGAAGCAG TTCGGCGAGG CTTGGGACAA AGCTCAGACC GAATGGGAGC GCAAGAACCC GAAAGAGGCT GAACAGGTTG ACACAAAATT CAAGACACAC CGCGCGATGG ATGCGCTGGT TGTGCTTGTG AAGGCCGTCG ATCCCAATGG ACCTGCGGCC GACAGCCTCA TCGATCTCAT CGACGCGTCG ATGGAACTTT TCGATACGGT GGACATGTTT ACGTTCATCG GCCGTTGCGG AGAGTCGCTA GAGAATCTGA TCGGTCCATT CGGATTCCGC GCTTTGCAAG GCACTGGAGG TCTCGCCAGC CTGCAGTGGG ACCGCGTTGA CTATGCCTTA ATGAATTTCC GCCGTCTCTC ACCCAAAGCA CGCTCGGCAT TGCGTCGCGA TCCAATGGCG GGCGATCAGT ACGGCATCAA ATACGACCTT ACCGAAGGGG AGCACGAAGC GGAAGATCGC ATTGTGGCCA TTCGCACGCT CCCGGACGGT TCCACCCACC AAGGCACCCT TGGGGAATTC AAAGTTGCTC TCGAGAACAA CCGCATCAAC CACATCCTCG CGCAACTAGA CCAGATCGCA AATGCGGGTC CCGGCTCGTT GGTTGGGCGC ATCATCGGCG GCGAGAAGGG TGCCGAAATC GGATCGATGG TGGATGCCGG CCTGATGGTG GCGGCACCCG TCAAGGCTCG CATGGACCAG CGTCGGCAGA TGGCGCAAAT GACCGATGAG CGTCTTTCTC CTCGTCTCGA ACCGGAAGCT CTTCGGCAAC GCGCGACGAA GCAAGATCCG ATGCCTAAGC CGGATCCGCC AAAACTACAG CCGCCGAAAA TCGAGCAGAC CAAACCGCCG CAGCCCCCGA AAATTGACCA GACTACAAAG CCTCCGCAGG CCCCAAAGGT TGATCAAACT GCGAAAGCTT CGCAGACGCC CAAAGTCGAG CCAAAAAAAG CCAACCCGCC ACAACCTCCG CCAACTGCAG ACACCAAAAA GGCAGAAAAA AAGACCACCG CTCCGAAGCA GCGCCTGGTT GAAGACGAAA CGAAATACAA GGATGTCAAG AAGGTTCGTG CTGACGAAAA GGTCGAAAAA ACGGATACCG CAGACCAGAA ACGCCGCAGC GATTCTGGAA GCGGCAAGGC AAAAGGTAAA GCGAGCGCCC GCACCACCGC GACCAAAAAG GCGGCGGAGC CGGACGTTGA CGATTCCAGC AAGTCTCAAT CGAAAGGAAA GGCAGCCAAG CAGACCAAGG TTGCGAAGAA GACGAACGCG TCCGATGATT CGACGACGAA GGCAGACGAG ACAACAGCCC CAACGAAGGA CCGAGTAAAG AAAGCACCCG CCAAGGCGAA GAGCGCTCCG AAAATCGATC CCGATGCGGA AGCGAAGAAG GCGGAGGCAG CCAAGCGAGC GGAGAACAAG GCCAAGATCA CAACCGAGCG GAAATCGCAG AGCGATATGA CCGAGTTGCA AATCAAAAGG CTGCGTGGGC TGCTGTTGCA GAACCAGATT GAGATTGGCG AAACGGAACG CAGTTCGGAA CCTAATCAGG AAAAGCGGGA AAAACTTGCC GATCTGCGCG GTGAAGCTGC GAGACTTGGG TCGGAGATCG GGAAGAAGCA GGCTGCCATC GCTCAACTCG CGGCGGATCC CCTTGCCCCC ATTCGAGCCT ATTCGTACTC GGCATCGGCG GAGCGCGCCG TGCTTGTTCG CGCAAAAGGC ATAGACGAGT ACTCAGCCTT CAGCGGTCAG CCCGGCAAAA AGATTCAAGA TCCCAGCATT GACCACCTCA ACTCCATTGA CGAAATGTCC AACTGGGACG GTTTCTGGGA GCTCACGGAA GATTCGCAGA AGAAACTTGT GAGCTATGAA AAAAACTTGT TCTTGATGGA GGAACCACTC AACTCTTCAA AGAGCAACCG CACCCGACTG ACGGACTGGA AGGCAGGCGT TCGCGCGTAC GGGCAGGATG GCGTCAAGGC TATGACACAA CGCAAAGGGG AAGTCGCGAT CGAGATGAAG AAATACATGA CGACACTTCC GACAAAGACT GGCGGCAAAT TACCCCTTCC GAAGTAG
|
Protein sequence | MSVGLRTSAR LAITKPSPAG KAATPAKNAT NYFSADQSVQ EASQPEYSFS ISKLNLTAHG DDPNPRPSAF AGHNRISSDS LEREADDMAA AVLRRPLSSS LQTARSEPVV QRKCECGGTC DKCKQERAAQ GSIQKKSDDS LREQSLEQSG VDRIVNTPGE PLERETRLMM EERFGWGFET VRIHADAAAA RSARALAAKA YTLGQHIVFS AGRYSPGTAE GRSLLAHELV HTLQQRNINS FPSQLPMSSR GSPAEVNARR TAEETARGDQ RGRSHASVMV PRTSAQIARD EDMPELTGDE TVVLMKLRFA KDLLRQTYSR EVLSGMIDKK MLIDAKTPSS NRLYVVFHPW RDDAPDKRDE SIFELNDLKI KPTTARTMIK VDGRQEIGDI ALELFANYKK TTGGSDTDVS DKYIGQEDKS KSREEHVKQM VDSVLWKQYE LINKKQSFEQ IGYKVTTSVP HEKEYDDAFH NVFMAEIAKG KLPADAIGPA KDAAISALNA AIAKEGYAKQ FGEAWDKAQT EWERKNPKEA EQVDTKFKTH RAMDALVVLV KAVDPNGPAA DSLIDLIDAS MELFDTVDMF TFIGRCGESL ENLIGPFGFR ALQGTGGLAS LQWDRVDYAL MNFRRLSPKA RSALRRDPMA GDQYGIKYDL TEGEHEAEDR IVAIRTLPDG STHQGTLGEF KVALENNRIN HILAQLDQIA NAGPGSLVGR IIGGEKGAEI GSMVDAGLMV AAPVKARMDQ RRQMAQMTDE RLSPRLEPEA LRQRATKQDP MPKPDPPKLQ PPKIEQTKPP QPPKIDQTTK PPQAPKVDQT AKASQTPKVE PKKANPPQPP PTADTKKAEK KTTAPKQRLV EDETKYKDVK KVRADEKVEK TDTADQKRRS DSGSGKAKGK ASARTTATKK AAEPDVDDSS KSQSKGKAAK QTKVAKKTNA SDDSTTKADE TTAPTKDRVK KAPAKAKSAP KIDPDAEAKK AEAAKRAENK AKITTERKSQ SDMTELQIKR LRGLLLQNQI EIGETERSSE PNQEKREKLA DLRGEAARLG SEIGKKQAAI AQLAADPLAP IRAYSYSASA ERAVLVRAKG IDEYSAFSGQ PGKKIQDPSI DHLNSIDEMS NWDGFWELTE DSQKKLVSYE KNLFLMEEPL NSSKSNRTRL TDWKAGVRAY GQDGVKAMTQ RKGEVAIEMK KYMTTLPTKT GGKLPLPK
|
| |