Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0332 |
Symbol | |
ID | 4070094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 360199 |
End bp | 363399 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982335 |
Product | hypothetical protein |
Protein accession | YP_589411 |
Protein GI | 94967363 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.635539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.188502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCT CGAAAGTTGA GCTTCAACTT TGTCTCGCGA TCGTCCTGTG CTTGCTTGCA TGCACTCCTC TGTTCGCCGC TGCGCCAACG TTGAACTCCA TCTCTCCAAA GTCCGCACCG CTGAATACCG CGGTCACGCT CCAGTTGGTC GGAGCGAATT TCGCCTCGAA CTCGCAGGTC TATTTCAACG GCAACGCCGT TCCGACGACC TTTAGCAGCA CTACGGTGCT GCAAGCTTCC GTCCCCGCTG CGAGTGTGGC CACTCCGGGG AATTTTGCAG TGACGGTGAC GACGCCCTCC ATGGGCACCA GCGCGGCGTT GATGTTCACC TCTTATGTCG CGCTGCCCAA CAACAGCATG GCGTATAGCG CCGCGACAGG TCAGTTGTAC GTGTCGGTAC CGAGCACCGC GGGCATGCCC TACGGCAATT CGGTGGTGGC GATCGACCCT GTGACCGGCG CGATCACGAA GTCGATCTTC GTCGGCAGCG AGCCTAACAA GATGGCAGTG AGCGCCGATG GCACCGTGCT CTGGGTAGGA CTCGACGGCA GCTCCGCTGT GCGCCAGGTT AGTCTGACCG CCGGCACCGC CGGGGCAAAG ATTACGCTGG GCTCAAACAC CGGTACGAAT GCACCGCCGG TTGCGCTGTC TCTAGCGGCA CTGCCGGGAT CGCCGAATTC GTTCGTGGTT TCCATGACCG CCCCGTTGGG CGGAACGGTT GTCGCGATTT ACGACAACGC GACGCGCCGC GCGAATACGT GGAGTGCCTC GACTTACTCT GGAAATGCAT TGCAGACGAA TGCAACGACC TCTGAGGTGT ATGTCGGCGG TCCGACCTAC TACCAACCTC TTTCTTATAG CGCGACCGGA TTGAGCGTTC CGAGGCTCGG TTCCTCGGGC AACTTCACCG GCAGCACAGA CGACCTGCAG GTTGTGAATG GCGAAGTCTA TACCGATCTT GGCGCGCTCT ACGACGCGGA GACCGGTGCG CGGAATGGTT CGCTTCTGAA TGGTTCGAAC CTCGCCGCGG GTCCCACTTT CACCGACACG CCGCTCGGCA AGACGTTTGT GTTCGACAGC CCGACCGCGA ACAAGTACAC GCAGGTGCAG GTTTTCACCA CGAGCACTTC GGCGTTGGCT GCAACCTTCC CGTTGAACCT CGCTTCGAAC ACCACCGGCA CGCCGTCGCA CTTGTTACGC TGGGGCACAA ATGGCCTGGC AGTGCGCGAC AATGTGGCGA TCTATGCGTT CCGCTCTGCG CAGGTCACGA ACCTCGCGGG GATCAATGCA GACCTCAGCG TCACCCTCGC GCAGAGTGGC ACGCCGACCA CCGGTAATTC CATCACCTAC ACCGCGACGG TGAAGAACGC CGGACCGGCC ACCTCTACGA ATGTCGCCTT CACGGCCCAG GCTCCTGCGA CAGCGAGCAT CGTTTCCATT ACGCCGACCG TCGGCTCGTG TTCCAAGCTG AACGGCCTGA GCTGCAATCT CGGCAGTCTC GCGACCGGGG CAACCACGAC AGTTACGGTT GTGGCAAAAC AGATGCTCGC CGGGTCCGTG GTGCTCAACG CGCAAGTCTT TGGTTCGGAG AACGACCCCA ACCTGGCAAA CAACCAGGCT TCGACGTCAG TTCTCACCAT CACCGGTAAC CCTTATAACG GGGTTCCGAC GATCACCTCG ATTTCCCCCG CTGCCATTCA AGCGGGCTCG GGCACTACCA TGGTCACGGT TACCGGAACG GGCTTCTCGA CGGCAGCGTC GATCTTGATT GACGGCACTG CACTCGCGAC GACCGTCCTC AGCAGCACGC AAGCCACAGC TATGGTTCCT TCGACCAAGT TGGCGAGCCT GGGTTGGAGC AAGATCAACG TTTCCAATCC AGCGCCGGGC GGCGGAGTTT CCTACGTGCT GCCACTTTCA GTATTCAAAG TGCTGAGCGC AGGTGCGAAT CACATCGTCT ATGAGCCGTT CAGCCGCAAG CTGATTGCCA GCATTGGCGC CGGCGGCAGC GGATTCACTG CGAATTCGGT GACAACGATC ATCCCCGACA CGGCGACGGT TGGCACGACC ATGTTGCTCG GCGCCGCGCC GACCAGCCTG GCGGTCACAT CCGATGGACA GGCTCTGTAC GCGACCTTGC CGAGTGTGCC GAGCGTGGCG CGCTTCAACC TGCTCGCACA GAAGCTCGAC TTCACGTACA CGGTGCCGAA GGGTTCATCC TTCACTGGCA CGATTAACCT CCGCGGCGTT TCTACGCAAC CGGGGAATGT GAACACCGTC GCTCTGGATC TTGGCGCGAG CAACGGCATC GGCATTTACG ACTTCAACTC CACGACGAAG ACGGCTGCGT TGCGTGGAAG CAATACCGGC AACTACACCG GTTCCTGCGT TCGTTATTCC GATTCGACGA ACCTGATGGC GTTTGACTCG GACAGCAACC TGACGTTCAA CCACTTCGCA GTGCCCGCGG CCGGGTTTGC CTATAGCAAT CCGACGCAGT ACAGCACCTG GTCGCTGGCC AGCTTCAATT GCTTCCAGAT GAACGGCGGC TATGCCTTCG CGAACAAGGG CGGAGCGGCC ATCCCGGTAT CCGCGGCGAC GACGGAGGTC GGCGTCTTCA AGCCGATCCC GAACGTCACG ACCTCGACCA TGCAGGTTGT GGCGCCGGAC GTTTCGCTGC ACGTGGTCTT CTATCTGGCG CAGACACATT CACTTTCCAG CACGAGCGCG GTAGACGGCT TGGTGACCTA CAACCAGACG ACGTACATGC CGAACACGAC GATCCCGATG GGCCTGGACC TGATCGAAAA CACGACGTCG TTCGGTGGCG TGGATTTAGT GCGCTGGGGG CAGGATGGCT TGGCCGCGTT GACTAGCACC GGTAAGATCT ACTTGCTGCG CGGCGGCGCC GTGGTTCCGC AACTGCTTTC GACGCGCACG GCGGCAACGC TGACGTCGGC ATCGGTTACG TCGGTGACAC ATGGTTCGGG CAACCTGTTG ATCAGTGTGG TGGGCACGAA CTTCCAGAGC GGTATGGTGC TGACCTGGAA TGGCAACTAT CGCACGACAA ACGTGACGGA CGCGACGCAT GCAACGGTTG CGATTCCGGC ATCGGATTTT GCGAGCATCG GAGCCGGAAC GATTACGGCA GTGAACGCGG GGGCTCCGGC TTCCTCGGGC CTGTCCATCA CCATCAACTA G
|
Protein sequence | MKSSKVELQL CLAIVLCLLA CTPLFAAAPT LNSISPKSAP LNTAVTLQLV GANFASNSQV YFNGNAVPTT FSSTTVLQAS VPAASVATPG NFAVTVTTPS MGTSAALMFT SYVALPNNSM AYSAATGQLY VSVPSTAGMP YGNSVVAIDP VTGAITKSIF VGSEPNKMAV SADGTVLWVG LDGSSAVRQV SLTAGTAGAK ITLGSNTGTN APPVALSLAA LPGSPNSFVV SMTAPLGGTV VAIYDNATRR ANTWSASTYS GNALQTNATT SEVYVGGPTY YQPLSYSATG LSVPRLGSSG NFTGSTDDLQ VVNGEVYTDL GALYDAETGA RNGSLLNGSN LAAGPTFTDT PLGKTFVFDS PTANKYTQVQ VFTTSTSALA ATFPLNLASN TTGTPSHLLR WGTNGLAVRD NVAIYAFRSA QVTNLAGINA DLSVTLAQSG TPTTGNSITY TATVKNAGPA TSTNVAFTAQ APATASIVSI TPTVGSCSKL NGLSCNLGSL ATGATTTVTV VAKQMLAGSV VLNAQVFGSE NDPNLANNQA STSVLTITGN PYNGVPTITS ISPAAIQAGS GTTMVTVTGT GFSTAASILI DGTALATTVL SSTQATAMVP STKLASLGWS KINVSNPAPG GGVSYVLPLS VFKVLSAGAN HIVYEPFSRK LIASIGAGGS GFTANSVTTI IPDTATVGTT MLLGAAPTSL AVTSDGQALY ATLPSVPSVA RFNLLAQKLD FTYTVPKGSS FTGTINLRGV STQPGNVNTV ALDLGASNGI GIYDFNSTTK TAALRGSNTG NYTGSCVRYS DSTNLMAFDS DSNLTFNHFA VPAAGFAYSN PTQYSTWSLA SFNCFQMNGG YAFANKGGAA IPVSAATTEV GVFKPIPNVT TSTMQVVAPD VSLHVVFYLA QTHSLSSTSA VDGLVTYNQT TYMPNTTIPM GLDLIENTTS FGGVDLVRWG QDGLAALTST GKIYLLRGGA VVPQLLSTRT AATLTSASVT SVTHGSGNLL ISVVGTNFQS GMVLTWNGNY RTTNVTDATH ATVAIPASDF ASIGAGTITA VNAGAPASSG LSITIN
|
| |