Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4387 |
Symbol | |
ID | 4073293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5200767 |
End bp | 5202275 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986420 |
Product | hypothetical protein |
Protein accession | YP_593461 |
Protein GI | 94971413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTT CAATCCTCTT CGCGTTCCTT GCGTGCGTCT GCGCAGCATA TGGCCAGCCG AACACATCTT TCTACGTTGC GACGACCGGG AAGGATTCAA ATGCCGGCAC GCAAGCAGCG CCGTGGCGCA CCATTCAACA TGCCGCAGAC ACGGCGCGCG CGGGCAGTAC CGTCAACGTG CGCGGCGGAA CCTACGAAGA GCTGGTGAGC CTCCACGCAT CCGGCAACGC CAGCGATGGT TTCATCACGT TTCGAAGTTA TCCCGGCGAG GCGGCGATCC TCGAGGCCGA GCACATCACG CCGCCCGGAA GGACCGGCGT ACTGACGATT CACGACCAGA GTTATGTGCG GGTCGAAGGC TTTGAGATTC GCAACTTCCG CACCGCCGAG CATGACCTCG CTCCGCTAGG CATCGACGTA ATGGGCGCGG GCTCTCACAT CGAGCTGCTG AAGAACAACG TCCACCACAT TCAGCAGACA TTTGAAGGGC GCGATGGTCC GGGGCACGGC GCCAACGCGT TTGGCATTGC GGTGTACGGA ACCAGCGCCA AAACTCCGAT CACGGATTTG GTCATCGATG GCAACGAGGT GCATCACCTC AAGACCGGTT CGAGCGAGTC GGTGGTGGTG AACGGGAACG TCACCAACTT CCGCATCACG CACAACGTTG TGCACGACAA CAACAACATT GGCATCGACG TCATCGGTTT CGAGCATACC GCGCCCGACC CGGCGGTGGA CCAGGCGCGC GACGGGCTCG TCAGTGGCAA CTTGGTTTAC AACATCACCT CGAAGGGCAA TCCCGCTTAC CGTAACGATG AATCTTCCGA CGGCATTTAC GTGGACGGCG GCACCCGGAT CCTTATCGAA CACAACGTAG TTCACGATGT GGACTTCGGC ATCGAGCTGG CGAGTGAGCA CAAGGACCGC GCCACCAGCT ACGTCATCGC GCGCAACAAT CTCGTCTATC ACAACCACAC CGCCGGTGTT TCCATCGGCG GCTACGATCC GCAGCGCGGA CACACCGAGC ACTGCACGGT GATCAACAAC ACGCTCTACG ACGACGACAC CTCGGCCACC GGCTCCGGTG AGTTCCAGAT GCAATGGAAC ATGGCAGACA ATATTTTCGC GAATAACATC GTGTACGCCG GGCCGCAGTG CCTGATGACG ATTCTCAAAA CTGAAGTCAA GCCCGGCCAA CCGCCCGCGA ATATCGATCA CAACCTCTAT TACTGCGCTT CCGGTGCCAA GGCGAGCACG TGGAAAAACA CTGCCGCCAC TGTGACGGGA TTTGAAGAGT ACTCGCAGGC CAGCGGCAAT GACCGCAATT CGCATTTTCA GGATCCCCAT TTTGTCGACG CTGCCGCGAA GGACTTCCAC CTGCAGCCAG ACTCTAAGGC CATCGCCGCA GGAGCCATTG ACGGAATGCC GGTGGGAGCA CTGGATCTTG ACGGCTCGCC GCGGACGAAA TCTGGCAACA TCGACATCGG CTGCTACCAA CGAAAATAG
|
Protein sequence | MKISILFAFL ACVCAAYGQP NTSFYVATTG KDSNAGTQAA PWRTIQHAAD TARAGSTVNV RGGTYEELVS LHASGNASDG FITFRSYPGE AAILEAEHIT PPGRTGVLTI HDQSYVRVEG FEIRNFRTAE HDLAPLGIDV MGAGSHIELL KNNVHHIQQT FEGRDGPGHG ANAFGIAVYG TSAKTPITDL VIDGNEVHHL KTGSSESVVV NGNVTNFRIT HNVVHDNNNI GIDVIGFEHT APDPAVDQAR DGLVSGNLVY NITSKGNPAY RNDESSDGIY VDGGTRILIE HNVVHDVDFG IELASEHKDR ATSYVIARNN LVYHNHTAGV SIGGYDPQRG HTEHCTVINN TLYDDDTSAT GSGEFQMQWN MADNIFANNI VYAGPQCLMT ILKTEVKPGQ PPANIDHNLY YCASGAKAST WKNTAATVTG FEEYSQASGN DRNSHFQDPH FVDAAAKDFH LQPDSKAIAA GAIDGMPVGA LDLDGSPRTK SGNIDIGCYQ RK
|
| |