Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0857 |
Symbol | |
ID | 4068951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1066208 |
End bp | 1069390 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982866 |
Product | hypothetical protein |
Protein accession | YP_589936 |
Protein GI | 94967888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA CCGCCTTGTC ACACTCCAAC TCGCAGCAGG AGAACACTTC CTCGAATTCG TTGCGCGTCG GCGACCCCAA TAGCTCATTC GAACGGGAGG CGGACCGCTT CGCGGATCAA GTCATGACTG GGGAGTCTCA CAAGAGTCCA TGGTCGCTAT CTCGCGTCAG CCTCGGACAA ACCCTACAGC GCAAATGTGG TTGTAAAGGG TCGAGCGATT CGAAGGACGA ATGCGATGAC TGCAAGAAAA AGAAGCTTTT GCAGCGCAAG CCGGCAGCAC TCGGAGCCCC ACATGTCGCT CCGCCCATTG TGCATAGCGT TCTGAATTCG AGCGGCCGAT CTCTCGATCA TGCAACAAGA TCGTTCTTCG AGCCGAGATT AGGCGTCGAC CTCGGAAGCG TGCAAGTACA CGACGATGGT CGTGCAGCGA GGTCCGCAGA GGCGGTCAAT GCCCACGCCT ACACCGTCGG CAATTCAATT GTGTTTGGCG AAGGCCGCTA CCAGCCTGAA AGCGCTGAGG GCCGTCGCCT GCTGGCCCAC GAGTTGACCC ACGTCGTTCA CCAACAGGGT CCAAGCGAAC GTCTGGTCCA GCGGGAAGAG GATTCGGATG TTGAAGTCGC TGACGCGCCT CGAACCGGTT CGCTGATCGA AAAGGCTTTT GATGCCGCCG ACGCAGCGCA CTGGGAACAA GCTGCCGAGT TGGCAAATGG ACTAAGTGCC GGCGATCTAA AAGCGTTCAT CGGCTCCTTG GGCGCGGGCT GGAAGACCGA GCAACTCCAC ATCGGCGCCA TCAGCAACGC TCGCGTTGGG CCTGGCTCTG CGGTCGCGAA GATGACGCAT TGGGCATTCC TCAACGCCAA ATTTTCGGAA CAGATGAAGG GCGGGTTCTA CCAGGCAGCG TCGGAATATC TCAACGGATT TAGTCAGGGC GAGATCCGCT CCCGCATCAG CAAAATGAAA ACTGAGGTTG TCGCCGGCCT GCACAGCGGC GCCGTCGCTC AGCCCGGGAT TGGCGCAGAT TCGAACGCCG CCAAGATCAC TGGCGAGGAA CTCGATAAGC GTAAAGAAAA AGGCGATGCC GCCGCCGATG CCGCAACCAA GGCAGCCATT CCCGAGAATC CTCAACAGAA AAAGAAGCGT TGCCAGGACA CCGCCGGCCA AGGCTTCAAG ATCTTCCCTC TCCGCTTGCC CAAAGGTATG TGGCAGCTTT CAAACGCGCC CATCGGCGCG GAACGCAAAG GCGACGAGAT CCTCGTCAAG CAGCCTCTGA ACGACGTCAA AGGCGATCCG ATGTTCCGCC GCGAAACGAA AACTCTGCCG CTCGAAACCT TCCTCGGGGG CATTCGCTTG AAGAAGGACG AAGTCGTTGG CGTGCGCCTC TATGACGACA AAGAACGCCT CGTCTGCGTA ACCGGCGAAG ACATGCTCAA GTTCAACGAC GCGACCGAAA TGGCGCTCTG GTTCAGCGTC GGCCGCACCG CTTTGGATGC CGCAACCATC TTCGCGCCGG GTGCGAGCGC CGGTGCAAGC AAGGTAACCG GCTTCGCGGT TGGCAACATC GTCGCGGGTG AACTCCTCGA TGTCGGCCGT CAGGAAATGG AAGTCAAGTA CGGACTGCGC GAAGAAGTCG ACTGGGGTGG CATCGCCTTT GACACTGTCT TCCAACTCGC GACCCTGGGC TTCTCAAAAT ATCTGAATAA TGCTGCCACG AAAGCTGTGC TCGGAAAGGC TCCGGAACTC GGGCAGAAAC CAGCGCAACT TGCCGTCCAT GCCGCTCTCG CCGGCGCCAC CAACTTGGTG CAGACCGCCG CGAGAACCGC GTTCGACATG CTGCGAAAAG AGAAAAAGAA ATTCGTGATG GAAGATTTCC TGATCGAACT CGCGCAGGCC TTCGCTACAG GAACACTTTT TGCATTCGTA CATGGCGCTG CCGTTCACGA AGAAGGATTG CCGCAAGAGA AGCAAGCTGC CCCATCCGAG CATCAGCAGG GCGCACCACC TGTACACGAT CAAGCGGCTC CACCACTACA CGATCAAGCG GCCACGCCAG TGCACGATCA AACCGCGGCT CCAGTCGCTA AGCCGCAGAA GAAAACGCCC CCCGTTCATA CGGACGAGCA CGTGACGACA GCGCCTCCCG ACAAGGCACC TGGCGAGCGC AAAGGCGCTG CGCCTCTTCA CGAAGACACA CCACCGGCCA CCACGCAGAA ACGCGCAATC GGTGCGCCTC CAGAAGAGCA CGCCGGAACG CCTAGCGCGA AGAAGACGGA CGGACCGGAA GGCCAAACTG CCGCGGCCGT ACAAGAGAAG GACGCAACCG CCAAGAAGAA GACTGCGGAT GGCAAACACG ATGTCGTCGT TACCGAGCAG GGTGTCGGGA AGTGTAGTCC TCCGCCTTGC CCTGTGATTC ACGTGGAATA CAAGAAGGAA CTTGATGCCC ATCCTGAGTT CAAGGAGTGG AACGAATCGG TCCAGAACAT GCGTAAGGCC GATCCCGAGT TTGCCGCCGA ACAGGGCAAG AAGCTCATCG CCGCGTTGGA AGACGTACGT GCAAATGGAG GGAAGCTCTC TGGCGAAAAA CTAGTTCAGC ATCGCGAAGC CGCTTTGCAA GCGCGCCTCG CAGAAGCCGA AGGAGATCTG CACAAGGCAC GTTGGGACAC CATCGACTAT CAAGCCGAGC GAGCCGCGAC TGGCAAGAGC AGGAAGGGCG GGCCGATCAA GGGACTCTGG AATGTCAAAG AACGGATATG GGCAATCAAG AGGCAGATGG CCTATCCGAA TCGCACGATT CTCGAACAAG CGCACATTGT CGGTGTGCGT GCTCCTGACG GAACAATCAA GCCCACAAAC GAAATCGGCA AAGGTGGACG CATCCCGGAT TACGTAGAGG TACGCGGCCA GAAGATTGTG GCGGGCGACC TCAAGTCCGG AGAGGAATTC AAGAAAAGCA TCGCCGGCGG ACTGGCGAAA CCAGGCGAAA TCGAAGCGGA GTTCCGCAAG AGCGCGAAGA TCGCGCAGCA ACAAGGCGTA GAAGACAAGG TCCTGAATGC AGCAAAAGGA AATGGCGGAA AGATTGTGAT CGAGGGCTTT GACGTAACAA CTGGCGAGAA AGTCGTGAAA GAAGTGGATC CGGCCGATTA CGGGTCGGAA GTAATTACCT ACGACGACGT TCGCACCAAC TAG
|
Protein sequence | MSRTALSHSN SQQENTSSNS LRVGDPNSSF EREADRFADQ VMTGESHKSP WSLSRVSLGQ TLQRKCGCKG SSDSKDECDD CKKKKLLQRK PAALGAPHVA PPIVHSVLNS SGRSLDHATR SFFEPRLGVD LGSVQVHDDG RAARSAEAVN AHAYTVGNSI VFGEGRYQPE SAEGRRLLAH ELTHVVHQQG PSERLVQREE DSDVEVADAP RTGSLIEKAF DAADAAHWEQ AAELANGLSA GDLKAFIGSL GAGWKTEQLH IGAISNARVG PGSAVAKMTH WAFLNAKFSE QMKGGFYQAA SEYLNGFSQG EIRSRISKMK TEVVAGLHSG AVAQPGIGAD SNAAKITGEE LDKRKEKGDA AADAATKAAI PENPQQKKKR CQDTAGQGFK IFPLRLPKGM WQLSNAPIGA ERKGDEILVK QPLNDVKGDP MFRRETKTLP LETFLGGIRL KKDEVVGVRL YDDKERLVCV TGEDMLKFND ATEMALWFSV GRTALDAATI FAPGASAGAS KVTGFAVGNI VAGELLDVGR QEMEVKYGLR EEVDWGGIAF DTVFQLATLG FSKYLNNAAT KAVLGKAPEL GQKPAQLAVH AALAGATNLV QTAARTAFDM LRKEKKKFVM EDFLIELAQA FATGTLFAFV HGAAVHEEGL PQEKQAAPSE HQQGAPPVHD QAAPPLHDQA ATPVHDQTAA PVAKPQKKTP PVHTDEHVTT APPDKAPGER KGAAPLHEDT PPATTQKRAI GAPPEEHAGT PSAKKTDGPE GQTAAAVQEK DATAKKKTAD GKHDVVVTEQ GVGKCSPPPC PVIHVEYKKE LDAHPEFKEW NESVQNMRKA DPEFAAEQGK KLIAALEDVR ANGGKLSGEK LVQHREAALQ ARLAEAEGDL HKARWDTIDY QAERAATGKS RKGGPIKGLW NVKERIWAIK RQMAYPNRTI LEQAHIVGVR APDGTIKPTN EIGKGGRIPD YVEVRGQKIV AGDLKSGEEF KKSIAGGLAK PGEIEAEFRK SAKIAQQQGV EDKVLNAAKG NGGKIVIEGF DVTTGEKVVK EVDPADYGSE VITYDDVRTN
|
| |