Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0352 |
Symbol | |
ID | 4069594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 385617 |
End bp | 388730 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982355 |
Product | hypothetical protein |
Protein accession | YP_589431 |
Protein GI | 94967383 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.197071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.320234 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCC GCTCGTTGTG TCTCTGCTTT CTCCTGGGAG TTTCCGCTGC AATCGCGCAG CCCACACCAG CTTCCACAGC GACCAGCCAA GCGGCCATCG TGCCTCACCT CATCCGCTTT ACTGGACAAG TGAAAGACGC CAACGGAACG GTCGGCATCA CGTTCACACT CCACAAATCA CAGAGTGACA ACGCCGCGCT CTTCACGGAA ACACAAAACG TGAAGCTTGA CGGCGAGGGA AGGTACACCG TTCTCCTCGG GGCAACCAAG GGCGACGGCA TTCCGATGGA ACTCTTCACG TCCGGCGAAG CCCAGTGGCT CGCGATTCGC GTCGAAGGTC AAACGGAACA GCGTGTGCTG CTCGTCAGCG TTCCCTACGC GATGAAGGCG GCGGAGGCCG AGACTCTAGC GGGACACGCC GCGACAGACT TTGTGACCGC CGACCGGCTC ACCAGCACGG TTCAGCAACA AATGCGCCAG CAAGCCTCGA CCACAACCAC AGCGAAAGAC GCACCGACAG GGAAGCGCGG TAATGTGGTG ACCAACACCG CCACCAACTT CGCCGACGCA ACCAGTACCC AGGTTGTGCT CGTGACCCAG AGCGGCGCTG GTTCAGGACT CGTCGCCAGC GCGGTATCCG GGAATGGCGT CGCCGGATCC ACCACCACCG CGGCTGGCTT CGGGGTGTCG GGCGCCAACT CGGCTACGAC AGGCGTAGCG ATCGGCGTAC GCGGTACCAC CGTTGCCGAT AGCGGTATCT CGGTCTTCGG AACCGCCAGC GGAACGGCCG GCAGCGCGAC CGGCGTAAAG GGCATCACGG GAGCTCCGAA CGGATTTGGC GTCTTCGGCC AGAACACGGC GACCACTGGG CCAGCCGTCG GTTTCCGTGG CACGACCGCA TCAACCAGCG GCATCGGAAT TTTCGGCACC GCCACCGCCG CCACCGGCAC GGCAATTGGT CTGCGAACCT CGGTAGCCAG TCCGGGCGGG ACCGCCGCTG TTCTTCAGAA CACCGCCAGC GGAAAGTTGA TCAGCGGGCA ATCGGGTGCC ACAAATACCG AGGTTTTCTC GGTGGACGGT GCAGGGAACA CCGTGAGCGC CGGTGGTGTC CAGGCGGCGA CGATGAATGT GGTGAACACC ACAGTGCGTC AGCCGTTTCA ACTGAATGGT ACCGGCATCC TCGGCATCGG TGATCCTACC GAGTTGAACG TGTTTGTTGG GCGCGATGCC GGTAAGGTCA ACGTCGCCGA CTTTCCCACT GGGGCCGGAA TTGGAAACAC CTTTGTGGGA AACGGCGCCG GCGAGCACAA CATCGATGGA AGCAATAACA CATACGTCGG CCTTTTCACC GGTGGCGCGA TCCACTCTTC AGACAACACG GCGCTCGGTG ACAGTGCCGG AGCGGGAGAC GGTGCGAGAA ACACTGCCAT TGGCAAAGCC GCAGGCGCAG GCGTCCACGA TGACAACACC ACTCTCGGGT ACGAAGCTGG GTTTGGGAGT AGCGGTGCAC GCAACGTGGT AATCGGCGCA AGTGCAGCTT CAGATTTCTT TTCCGGTAAC GAAAATGTTG TTGTCGGAAT GCAGTCGGCG CTCCATCTTT CCACTGGTTC GCACAACACG TTCCTAGGCG CGGGGGCCGG AGCTTTGACC TCCACTGGAT CTTTGAACGT GATGATCGGC CAGAACGCCG GCACTGCGTC TAGCGCAGGT AGTGGAAATG TCTATATCGC AAGCAACGGC TGTAACCCAT CGCCGTGCAA TGAGAACAAT ACGATACGCA TTGGCGGGGA CTCCGGCCTT GGAACCGGTC ATACCGCGGC CTTCTTTGCG GGCATCAATG GCCATGCAAT TAGCACGGGT TCGCCCGTGT TTATCGACTC GAACAATCAG CTCGGCACCG GGCCTGCGAC GCTGCCACCG TCGGCCGGCT CCTCCTTTTA CATCCAGAAC AACACCGGCA GCCCGCAAAC CTCAGCCAGC TTCAACATTG ACGGCAACGG TTTCGCGGGC GGTCTCCTTC AAGGTGGGTT CGTAAACGCG ACCAGCACTG CGGCGAATAA ACCATACCGC GAGAATGGTG TCCCTTTCCT CGGTATCGGA GTCGAAGGTC AGAACAATGT CTTCCTCGGA GAATTGGCGG GCCAAAGCAA TGTGAGCGGG AGCGGACTCA ACAACACCTT CGTTGGTGCG TCAGCCGGTA ATTCAAATAC CGGAGGAGAT AGCAACACCT TTCTCGGCAG CTCCGCGGGA CAATCAAATG TGAGCGGCGG CTTCAATACT TTCGTTGGCG TTGATGCTGG TTTAAGGAAT ACGACCGCTT CCGGAAACAC CTTCATCGGT CAAACTGCCG GTATCGAAAA TTCAACCGGC GCCTCGAGTG TTTTTGTTGG TCATAGTGCC GGTGCCAACA ACACAACGGG TGGCCATAAC GTTTATGTTG GAACGACTGC TGGCCTCGAC AATTCGACGG GAGGCTTGAA CACTTTTGTC GGCGATGGAG CCGGCTTAAC CGACACAGGG AACGCCAATA TCTTTGTTGG CGCCAACGCC GGTGGCAATA ACACCTCGGG CGACAACAAC CTCTACATCG GCAATGTCGG GTGCACGTCG CCCTGCACCG AGAGCGCTAC GATTCGCATC GGCAACACCC AGACCTCGGC CTTCATGACG GGTATTGCCG GGAAGACCTC ATCGAGTGGC ATTACGGTCC TGATCAACTC GACAGGGAAA CTCGGTACCA CCACGTCGTC CCGCCGCTTC AAACAGAACA TCGCGAACAT TCCCGACAGC AGCAAGCTCT TCCAGTTGAG GCCGGTCACC TTCTTCTATC GCCCCGAATA CGATGACGGC ACCCACGTGC GGCAGTATGG CTTGATCGCC GAAGAGGTCG CGAAGATCTA TCCGGACCTC GTCGTCTTCG ACAACCAGGG CAAGCCGTAC ACGGTGCGAT ACCAGTTCCT CGCCCCGCTC CTTCTCGACG CCATGCAGAA GGAACACGCC GTGGTCGCCG CGCAGCAGAG CGTTATCGCT TCACAACAGA AACGCATCGA CGAACTCTCG CAGCGTCTCG CACGCCTGGA GGAAACCGTA AACCGTATTT CCGCGGCGCA CTGA
|
Protein sequence | MKLRSLCLCF LLGVSAAIAQ PTPASTATSQ AAIVPHLIRF TGQVKDANGT VGITFTLHKS QSDNAALFTE TQNVKLDGEG RYTVLLGATK GDGIPMELFT SGEAQWLAIR VEGQTEQRVL LVSVPYAMKA AEAETLAGHA ATDFVTADRL TSTVQQQMRQ QASTTTTAKD APTGKRGNVV TNTATNFADA TSTQVVLVTQ SGAGSGLVAS AVSGNGVAGS TTTAAGFGVS GANSATTGVA IGVRGTTVAD SGISVFGTAS GTAGSATGVK GITGAPNGFG VFGQNTATTG PAVGFRGTTA STSGIGIFGT ATAATGTAIG LRTSVASPGG TAAVLQNTAS GKLISGQSGA TNTEVFSVDG AGNTVSAGGV QAATMNVVNT TVRQPFQLNG TGILGIGDPT ELNVFVGRDA GKVNVADFPT GAGIGNTFVG NGAGEHNIDG SNNTYVGLFT GGAIHSSDNT ALGDSAGAGD GARNTAIGKA AGAGVHDDNT TLGYEAGFGS SGARNVVIGA SAASDFFSGN ENVVVGMQSA LHLSTGSHNT FLGAGAGALT STGSLNVMIG QNAGTASSAG SGNVYIASNG CNPSPCNENN TIRIGGDSGL GTGHTAAFFA GINGHAISTG SPVFIDSNNQ LGTGPATLPP SAGSSFYIQN NTGSPQTSAS FNIDGNGFAG GLLQGGFVNA TSTAANKPYR ENGVPFLGIG VEGQNNVFLG ELAGQSNVSG SGLNNTFVGA SAGNSNTGGD SNTFLGSSAG QSNVSGGFNT FVGVDAGLRN TTASGNTFIG QTAGIENSTG ASSVFVGHSA GANNTTGGHN VYVGTTAGLD NSTGGLNTFV GDGAGLTDTG NANIFVGANA GGNNTSGDNN LYIGNVGCTS PCTESATIRI GNTQTSAFMT GIAGKTSSSG ITVLINSTGK LGTTTSSRRF KQNIANIPDS SKLFQLRPVT FFYRPEYDDG THVRQYGLIA EEVAKIYPDL VVFDNQGKPY TVRYQFLAPL LLDAMQKEHA VVAAQQSVIA SQQKRIDELS QRLARLEETV NRISAAH
|
| |