Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2261 |
Symbol | |
ID | 4073255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2681596 |
End bp | 2683011 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637984277 |
Product | hypothetical protein |
Protein accession | YP_591336 |
Protein GI | 94969288 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000157771 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAT TTTTCCCCTC GCGTAGAATG AGTCGTCACG AAAGTGATAT CGCGCCAAGC GCGCCGGGGC GAAATCCCGT ACTGAGTACT GAGAGTCTGA AGGAGCTATT CGAAATGAAG AAACCGCTAG TGACGATGCT GCTAATGATG GCGACCGTCG CCATCCAGCC GCTTGCACTA CCCAGCCTTG CCGCCGGCCA AGCTGCCGCG CCCCAACAGA AGAAAGAAAT CAAAGACCCC GCGGAATACA ACGCATATGT AAACGCGGTG CAGCAAGCGG ACCCGAAGGC GAAGGCAACC GCGCTGCAGT CGTTCCTTCA GACTTATCCC AATAGCGTGA TGAAGACTGA CGCGATGGAA CTGTTGATGG CCGCTTACCA GCAGGCCGGC GACCAGCAGA ACATGCTGCA GACCGCCCAG CAGATCATCC AGGTCGAGCC CAACAACGTT CGCGCCCTCG CGCTGCTCGC ATACACCTAC CGCATGATGG CGCTGCAGAC GGGAAACAAG GACAACGCGG CCCAGGCTGC CCAGTATGGT CAGAAGGGCC TCACGGCGCT ACAAGTTATC CAGAAGCCCG CTGAAGTCAG CGATGCTGAC TTCGAGAAGC TGAAGAAGGA AACGCAGATC ATCTTCGACG GCGCTGCCGG CTTTGGCGCG TTGAACACGA AGGACTACGC AACCGCGCAG AAGGACTTTG AAGACGGCGT GAACCTCGCC GGCGCCAACG CCTCCTTCCT CGATGTCTAC CAGCTCGCGC TCGCCGATCT CGAGGCGAAC CCGGTGAACC CGAAGGGTCT CTGGTACATT GCGCACGCTG CCGCCACCGC CCCGAACGAC CAGGCGAAGA AACAGCTCGG CGACTACGGC CGCAAAAAGT ACAACAAGTT TCACGGTTCT GAGCAGGGCT GGCCGGAGTT GTTGACCGCC GCCGCTGCCT CGCCGACGCC GCCGCAAGGC TTCACGGTTG CCCCCGCGCC GCCGCCGCCG AGCCCCGCCG AGCAGGCTGC CGACCTCGTG AAGAGCAAGG AAGTGAAGGA CATGAGCTTC GCCGAGTGGC AGCTCGTCCT TTCGTCCGGC AACCAGGATG CGGCTGACAA GGTGTGGAAC ACGATCCACG ACAAGCAGAT CCAGCTCGTC GCGTTCGTAA TCAGCGCCTC GCGCACCAAG CTCGAACTCG CCGGCAGCAC CGACGACAAC GACGCGCACA AGGCTGACAT CACCGTCACC ATGGCAACCC CGATTCCGGC AGCGAAGGTG CCCAAGGAAG GCGCGACCGT GCAGTTCCAG GCCGCTCCCG ACACCTACAC GCCGAATCCG TTCATGATGA ACATGAAGGA CGGCGAACTG CCCGGCGTAG CCGCTGCACC GGCCCACAAG CCCGCAGGCG CCCACAAGAA GCCCGCCGCG CAGTAA
|
Protein sequence | MSQFFPSRRM SRHESDIAPS APGRNPVLST ESLKELFEMK KPLVTMLLMM ATVAIQPLAL PSLAAGQAAA PQQKKEIKDP AEYNAYVNAV QQADPKAKAT ALQSFLQTYP NSVMKTDAME LLMAAYQQAG DQQNMLQTAQ QIIQVEPNNV RALALLAYTY RMMALQTGNK DNAAQAAQYG QKGLTALQVI QKPAEVSDAD FEKLKKETQI IFDGAAGFGA LNTKDYATAQ KDFEDGVNLA GANASFLDVY QLALADLEAN PVNPKGLWYI AHAAATAPND QAKKQLGDYG RKKYNKFHGS EQGWPELLTA AAASPTPPQG FTVAPAPPPP SPAEQAADLV KSKEVKDMSF AEWQLVLSSG NQDAADKVWN TIHDKQIQLV AFVISASRTK LELAGSTDDN DAHKADITVT MATPIPAAKV PKEGATVQFQ AAPDTYTPNP FMMNMKDGEL PGVAAAPAHK PAGAHKKPAA Q
|
| |