Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0638 |
Symbol | |
ID | 4069576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 783336 |
End bp | 786545 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982644 |
Product | hypothetical protein |
Protein accession | YP_589717 |
Protein GI | 94967669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000046909 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0437117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAA ACGAGCGCGT GCAGCTATCA GAGCGCATCC CAGGCACCGT GCGTCAGGTC CTACTTTTGG TTCTTTTCTG TTTGTTAGCT GTGCCGTTAT TTGCGCAGTT TGATACCGGC ACGATTACCG GAACCGTGAC TGATTCCTCC GGCGCCGCAA TCCCCAGCGT AAAAGTCACT GTGACCAACA CAGGCACCAA CGTTCAGAAG ACGGTGACGA CGAACGCGAC CGGATATTAC GTCGCGTCGG AATTACCGGT CGGAAATTAC GTAGTGAGTG CGAATTCGAC CGGCTTTGCC GAGACAAAAA GCCAGAGTGT TGTGTTGAAC GTAGGCGCGG TCGTACACGC GAACCTGGCA ATGGCAGTCG CTGGGAGCGA GCAAAAAGTT GAAGTGACTG GCACCACGAC GTCCGTTGAC ACCGAGACGG CCCAAAGCGG CACCACGTTG AACGCGACGC AGGTGGCGAA CCTTCCGATC AACGGGCGCG ACGTGAGCAA CTTCCTTGAG ATCGCTCCGG GGTCGGTGGC TTCAACCACA TTCTTCCAAG GGAGCGTCAA CGGGCTTGAG AACATTTTCA CCGGTCTGAA CATTACGGTT GACGGCCAGA ATGCCTCACG CGGAGACATT AACGGATTCC TCGATACGGA AGGCCAGGAA CTGGCGCGGG TCACGCGCGC GAGCGTCGAC AGCATCCAGG AAATTGACTT TACTAACAGC GGTTTCAGCG CTGAAGCGGG ACGGTCACTG GGCCCGCAGA TGAACATCAT CACCAAGTCG GGCACGAACG ACTTCCACGG GACAGCGTTT GAGTTCCTGC GGAACGACGC ACTGGATGCC AAAGACTACT TCAATAACGG CAAGGCTGCC CCACTGAGGA TGAACCAGTT CGGCGGCAAC ATCGCCGGTC CAATCATCCG CAACAAGCTC TTCTTCTTCG CGAATTACGA GGGCGATCGC ACGCACATTA CGAACTTCAA TGCGCTGTAC GAGACGCCAA GTGCGTATAT GCGTTCGAGG CCACATGACC CTCGGATGGA TCCTGTCTTC GCGCAACTGG CGCCGGTTCC GGTGGGATGT ACCAGTATTC CTGCGCCGGC TTCCTGCGCA GTGCCAGGAA CCACGGATAC CACCGACGCG ACAAGTACCG CGGGCGGCGC GCAGCTCGTG TATGAAACCG CGTCGTTGCC CGATATTCTT CGCGAAGACA CCGGGTCTGT GAAAGTTGAT TGGAACATTT CGGACAAAGA TCGGATTTTC TTCCGCTACA ACATCAATGA TTCGTTAACG GATTACACGT ATGGATTGAA CCAAGGACAA GTTTCGCCGC AATCCATGCG GACGCAGTTG TTGAAGGTGG ATGAGACGCA CACCTTCAGC CCGACATTGC TGAACCAGCT GAGCTTGGCG TTGAACCGGT TCTACTCAAA CACGGACTCG AATACGCCAG AACCATTTGT GGGGTTCAGC GGCTTCTTCA CTAACCTCGG CAATCTGCCG GGGCCGAACA CGTTCAACCA GATCACGCCG TTCAATGTGT TTGAAGTGTT TGACAACGTC ACGAAGACGT CGGGGCGTCA TACGCTGCAT TTTGGCGCAC AGATTCGCGC CAACCAGTTG AACGAGTGGT TGCGTCCTCA GCAGGTGTAC CAATTCGGCG GAGCAAACAT CTTTCAGCCG GATATGTTCC ACGATTTCCG GACCGATAAC CCGTTCGTTT TGCAAAAGAT CGGGTTCCCG GGATTTGTGG GCGTGACCAA CTCAAATTGG GGTCTCTATC TCCAGGACGA TTGGAAGGTA ACGCGCACCC TGACGTTGAA TATCGGCGTT CGTTACGAGT ACAACACCGC GTGGAGCGAG CGTCACAACA TCGAGCAGAA TTTCGATTTC GCGACGCAGG CCTTCCTGCC GCAAAACCAG GACATCTACA ACGCGCCGAA GGGTGATGTT GCGCCGCGCC TCGGCTTCTC GTGGGACCCG TTCGGCACTG GAAAGACGGT GGTACATGGA TACTACGGTC TGTTCTATAT GCCGATGCAG TACGGCTTCG GGATGATCGG GAACATTCCC GATTACCAGA GCTATAGCGT GAATGTGTTC CAGGCGATTT TCGGAAATCC TCCGTTCTCG ATCGCCTATC CGTCACCCAA CCCGCCGTTG CAGCCAGGAA CGCAAAACGT AAATATCTTC CCGAGTAATC CTCAGGATCC TTTCTCCGAG AACTGGATGT TCGGGATCGA GCAGGAGATT GCCCCAAACA CGGTTCTGGC CTTGAACTAC ATTGGCAACC ACGCTATGCA CATGCAGGCG GGCGTTTCGT TCGCGAATGT GAACCTGAAT CCCGCTAACC CATTCACGCA GGCGCGTCCG CTATCGGGAT ACGCGAGCGA GAACTATCTG TGCGATTGCC TGTTCTCGAA ATACAACTCA CTGCAAGCGC AAGTGCGGCA CAACATCGGG AAGTTGAATT TCGAAGCGAA CTATGTCTGG TCGCACGAGA TTGACGACCA GATGAACTTC CTGAGTCCCG GGTATACCAA CCCGGCCGAC CCGAAGGCTG ACATTGCCAG CGGCGATTGG GACGTGCGTC AGAACCTGAC AGGCAGCGTG GTGTATGCGT TCGGAAACCT GAAGGGAGAA GCGGCGTGGA AACGGGCCAT CCTGGGCGGA TGGCAGGCTT CAACCATTCT CCAGGCGCGG TCTGGCTTGC CGGTGAACAT CACGCTGGTG AGCGGATTGT TTGGGAACCC GACGCGTCCG AATCGCGTGG CGGGACAACA AGGCTACCTG TCGGACATAA ATTGGCCGAA CAGCAGCTTC AACTCCGCCG CTTATGAGAT CAATCCTAAC TACACCGGCG ACTGGGGCCC AGTGTGGGGC AACACGGGAC GGAACGACCT GCGTGGTCCT GCCTTCGTGC AGTGGGACAT GTCGGGGATG AAGAACATTC CGATCACGGA AAGGGTAAAC CTGCAATTCC GTGCGGACAT CTTCAACATC CTGAACCACC CGAACTTCGC AACGCCCGAT GGTGGAATCT GTAGCTCGAT CACTGGGGCG TTCACGGATG CCTCTGGATT CCATCCGGCA ACCTGCGCGC CGAATCCGAA CTTCGGCCGC ACCGGACAGA CAGTTGCTGA CTCGAACACA AGCCAGATCG GACCGGGAAC CAACCGGCAG ATCCAGCTCT CGCTGAAGCT GGTTTTCTAG
|
Protein sequence | MTTNERVQLS ERIPGTVRQV LLLVLFCLLA VPLFAQFDTG TITGTVTDSS GAAIPSVKVT VTNTGTNVQK TVTTNATGYY VASELPVGNY VVSANSTGFA ETKSQSVVLN VGAVVHANLA MAVAGSEQKV EVTGTTTSVD TETAQSGTTL NATQVANLPI NGRDVSNFLE IAPGSVASTT FFQGSVNGLE NIFTGLNITV DGQNASRGDI NGFLDTEGQE LARVTRASVD SIQEIDFTNS GFSAEAGRSL GPQMNIITKS GTNDFHGTAF EFLRNDALDA KDYFNNGKAA PLRMNQFGGN IAGPIIRNKL FFFANYEGDR THITNFNALY ETPSAYMRSR PHDPRMDPVF AQLAPVPVGC TSIPAPASCA VPGTTDTTDA TSTAGGAQLV YETASLPDIL REDTGSVKVD WNISDKDRIF FRYNINDSLT DYTYGLNQGQ VSPQSMRTQL LKVDETHTFS PTLLNQLSLA LNRFYSNTDS NTPEPFVGFS GFFTNLGNLP GPNTFNQITP FNVFEVFDNV TKTSGRHTLH FGAQIRANQL NEWLRPQQVY QFGGANIFQP DMFHDFRTDN PFVLQKIGFP GFVGVTNSNW GLYLQDDWKV TRTLTLNIGV RYEYNTAWSE RHNIEQNFDF ATQAFLPQNQ DIYNAPKGDV APRLGFSWDP FGTGKTVVHG YYGLFYMPMQ YGFGMIGNIP DYQSYSVNVF QAIFGNPPFS IAYPSPNPPL QPGTQNVNIF PSNPQDPFSE NWMFGIEQEI APNTVLALNY IGNHAMHMQA GVSFANVNLN PANPFTQARP LSGYASENYL CDCLFSKYNS LQAQVRHNIG KLNFEANYVW SHEIDDQMNF LSPGYTNPAD PKADIASGDW DVRQNLTGSV VYAFGNLKGE AAWKRAILGG WQASTILQAR SGLPVNITLV SGLFGNPTRP NRVAGQQGYL SDINWPNSSF NSAAYEINPN YTGDWGPVWG NTGRNDLRGP AFVQWDMSGM KNIPITERVN LQFRADIFNI LNHPNFATPD GGICSSITGA FTDASGFHPA TCAPNPNFGR TGQTVADSNT SQIGPGTNRQ IQLSLKLVF
|
| |