Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3334 |
Symbol | |
ID | 4070296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3952847 |
End bp | 3955639 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637985356 |
Product | hypothetical protein |
Protein accession | YP_592409 |
Protein GI | 94970361 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.896794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCT GTACAACTGC AAAGTTCATG AAATTGTCCC TGGATCGCAC ACGCCCGTTC GTCGCTGGTT TGCTGGCCCT CTGCCTGGCC TCCCCTTCGG CCCTACTCTC CCAGTCTGCT GCTCCGCCCG CGCCGGAAGC CAAAGTTGTT GATGAGCTTG CGCCTTTGCC CCAGGACGAT GGCCGTCTCG GCCTCGAACT GCTACTCAAG CGCCTGAAGA CCACCGCGCG CTTGATGCAC ACCACCGCTC ATCCCGACGA CGAAGATGGC GGTATGCTCA CCCTCGAATC GCGCGGCAAG GGTTACGACG TTACCCTGAT GACCCTCACC CACGGCGAGG GCGGCCAGAA CAAGACCGGC AGCAATCTCT TCGACGAACT TGGCGTGCTC CGCGTCCTCG AACTGCTGGA GAGCGATAAA TATTACGGCG TCCACCAGCG CTTCAGCATG GCCGCTGATT TTGGCTTCAG TAAGAGCCCG CAGGAGTCGC TGGAGAAGTG GAAGGACGGC GGCAAGCCCG GCTACATCCC GCTGCGCGAC ATGGTGCGCG TCATCCGAAC TTTCCGTCCT GACGTCATCG CCCAGCGCTT CCAGGGCTCC GAGCGCGACG GCCACGGCCA TCACCAGGCG TCGGGCATCA TCACCAAGGA AGCGTTCCGC GCCGCCGCCG ATCCCAAGCA ATTTCCGGAA CTCAACCTTC CGCCGTGGCA GGCTAAAAAG CTCTACATGG ACAACGTCCG CGATGGCGAG GAGTACACGG TCGCCTTCGA TACCGGCGCG GAAGACCCCG CCCTCGGCAT GAGCTACGTG CAGTTCGCGA TGAAGGGCCT GAAGCACCAG CTCTCGCAAG GTGCCGGCGC ATGGAACGTC GATCCCGGCC CGCACATCAC GCGGTACAAG CTGATTGACT CAGAGTTGCC GCAACCCAAA GCCGGCGAGC ACGAAAAAGA TTTCTTCGAC GGCATCGACA CCTCACTGCC CGCGCTCGCC GATCGTCTCG GCGCGGATGA GAACAGAGTC CCATGGCTGA AGCCCGGTTT GGAAGAAGTT GCGAAACTGA TTGACCAAGC CAGTGAAGCC GAGAAGAAGG ACATGGAGAG CGCGGCGGCG CCGCTGAGCG CAGCCGCTCA AAGCCTGAGT GCGCTGCGGA CAAGGATCGA AAATAGTCCG CTCGACGATG CATCGCGTCG TGACTTACTT TCGCGCGTGG ATGAAAAGCG GCAGCAGGTG CGGAAAGCGG AGGCGCTCGC ACTGGGGTTG AAAGTCAAAG TAACCACACT TGCGGAAATG GCCGGAAAAC CTTCCGTCAC CCGCGGAACT CGCGTTCCGG TGGATGTCGC GATTGAGAAT GGCGGGTCGC AAGCGTTGCA GTTTCAGCTA GCCGATGTCG AGGATCCTGC GGCCGCCAAG CGGGCGGTGA AGTCGCTTGC CATCGCGGCC CATGGGCATT CAAGCGTGGT TGCCGAGAAC GTCGCGCGCG AGCCGCTCAC CAAGCCCGGG TTCCACCGTG ATAATCCGGA GGTTGACTCG TTCTACGAAA GCTCGGACGC TGATCCCACG ATCCCGTTCT CGGCCGGACC GCCGACCGTC TGGGTGTCTT ATCGCGCGGA GGGAAGCAAA GCCGCGATCG ATGGCTCCCA GGAAGTTCCA GTAGAGACCG CGATCCATCA GCCTGACGGC ACCACCCGCC TCCGCCCCCT GGCAATCGCT CCGAAGTTTT CGGTGCTAAT CGAGCCCAGC ACCCAGACGG TTCCCACCAC CGCCAAAGAC GCCCGCAAGA TCGACGTCCA CGTGCGCAGT ATCGCCGGCG GCCCTGCCAA AGGCACGCTG AAACTCCTCG TCCCCTACGC GTGGCGTGCG ACGCCGCTGC AACAGGCCAT CGACTTCAGC AAGTACGGCG AAGAGAAGAC TGTCTCGTTC CAGGTCACGC CCGGCGAACT CTCCGAGCAC AAGGCGAAGA TCGAAGCCGT GTTCGAGAGT GAAGGGAAGA GGTACACCGA GGGCTACACC ACGGTGGAGC GCGAGGACCT CGGCACCTAC TACTACTACC AGCCTTCGGT GCAGAAGATC AGCGAGGTCA AGGTCGCGAC GCAGCCGGGC CTGAAGGTCG GCTACATCAT GGGCGCCGGT GACGACATCC CGACCGCGCT GCAGCAGATC GGCGTGGACG TGGCAACGAT CACGCCTGAA GAACTCGCGA GCGGCGACCT CTCGAAGTAT GGGACCATCG TCGTCGGCAT TCGCGCCTAC GACACGCGTG ACGATGTGAA GCAGCACAAC GATCGCCTGC TCGAATTCGT GAAGAATGGC GGCACGCTCA TCGTGCAATA CAACGCCGGC GTCGCGGACT TCAACGCCGT CAAGCTGCCG CCGCAGACCG GACCGGTGCC ACCCAAGATG GACCCCAATT CGCCGATCCC CGGAAAGTAC GTGCCGTATC CGGCGGAGCT GAGCCGCGAC CGAGTGAGCG TGGAAGACGC GAAGATCACG GTGCTCGAGC CGCAGAACGC GATCTTCCAT TCGCCGAACG AGATCAAGCC CACCGACTTC GACGGCTGGG TGCAGGAGCG CGGCCTCTAC TTCATGGACC GCTGGTCGCC GGAGTATCAC GCGCTGCTCG AGTGCCACGA CCCGAACGAG CCCGAGCGCA AGGGCGGCCT GCTCGAAGCG AAGTACGGCA AGGGGACGTA CATCTACACC GGCTACGCGT TCTTCCGGCA ATTGCCGGTG GGCGTGCCCG GCGCGGTGCG GCTGTTCGTC AACCTGCTGA GCGCAGGGCA TGAGCAACAC TGA
|
Protein sequence | MQTCTTAKFM KLSLDRTRPF VAGLLALCLA SPSALLSQSA APPAPEAKVV DELAPLPQDD GRLGLELLLK RLKTTARLMH TTAHPDDEDG GMLTLESRGK GYDVTLMTLT HGEGGQNKTG SNLFDELGVL RVLELLESDK YYGVHQRFSM AADFGFSKSP QESLEKWKDG GKPGYIPLRD MVRVIRTFRP DVIAQRFQGS ERDGHGHHQA SGIITKEAFR AAADPKQFPE LNLPPWQAKK LYMDNVRDGE EYTVAFDTGA EDPALGMSYV QFAMKGLKHQ LSQGAGAWNV DPGPHITRYK LIDSELPQPK AGEHEKDFFD GIDTSLPALA DRLGADENRV PWLKPGLEEV AKLIDQASEA EKKDMESAAA PLSAAAQSLS ALRTRIENSP LDDASRRDLL SRVDEKRQQV RKAEALALGL KVKVTTLAEM AGKPSVTRGT RVPVDVAIEN GGSQALQFQL ADVEDPAAAK RAVKSLAIAA HGHSSVVAEN VAREPLTKPG FHRDNPEVDS FYESSDADPT IPFSAGPPTV WVSYRAEGSK AAIDGSQEVP VETAIHQPDG TTRLRPLAIA PKFSVLIEPS TQTVPTTAKD ARKIDVHVRS IAGGPAKGTL KLLVPYAWRA TPLQQAIDFS KYGEEKTVSF QVTPGELSEH KAKIEAVFES EGKRYTEGYT TVEREDLGTY YYYQPSVQKI SEVKVATQPG LKVGYIMGAG DDIPTALQQI GVDVATITPE ELASGDLSKY GTIVVGIRAY DTRDDVKQHN DRLLEFVKNG GTLIVQYNAG VADFNAVKLP PQTGPVPPKM DPNSPIPGKY VPYPAELSRD RVSVEDAKIT VLEPQNAIFH SPNEIKPTDF DGWVQERGLY FMDRWSPEYH ALLECHDPNE PERKGGLLEA KYGKGTYIYT GYAFFRQLPV GVPGAVRLFV NLLSAGHEQH
|
| |