Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4220 |
Symbol | |
ID | 4073146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4998539 |
End bp | 4999627 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986251 |
Product | hypothetical protein |
Protein accession | YP_593294 |
Protein GI | 94971246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00877371 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0608426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC GCGATACTAA TCGCATTGTC CGGCCCTTTG AATGGGGGCA GGAATGGACG CGTGATTTTC CCGGCGCGGA CCGCCTCCAG TGGGGCGAGA CCGCGGCTGA ACACTTCGAT TACTTCACGG AGTTGAATCG CCACATCGTC GAACACAGCG ACGAGTTTTT TTCTTACAAG ACGCCGAGCG ACTATCGCCT CGAGAAGCGT CGGGTGCAGG TGTTCTTCAC CGGATCGGGA GAGCCGCCCA AGGATCCGGA TGAGACTGGC ACCTATCTGC GCTTCACTTC GCCGCATCCG TCGCCGTATC TGGAGAACAA CGTTTTCAAT GCGCGCTGGT TCCCGGCGAG AGGGAAGCGG GCGATTATCG TGCTGCCGCA GTGGAACGCC GATGGCATCA GCCACAACGG CTTCGCACGC ATCTTCAACC CGATGGGCAT TGCGGTTCTG CGTATGAGCA AGCCGTATCA CGATATTCGG CGGCCGGCGG AGTTGCACCG CGCCGACTAT GCGGTGTCGT CGAACGTCGG GCGCACGATT CATGCGGCGC GGCAGGGTAT CACCGATATT CGCGCCGCTC TCGATTGGCT CAACTCTGAA GGCTATACGC AGCTGGGAAT CCTAGGCACG AGTCTTGGTT CCTGCTATGC GTTCATCGCG AGCGCGCATG ACGAGCGGCT GCGGGTGAAC GTCTTTAATC ACGCTTCGAC ATACTTTGGC GATGTGGTTT GGACGGGGCA GTCGACCCGT CACGTGCGCG CGGGGATTGA AGAGGTCGGC CTCGATATGG ATGCGTTGCG GAAGATATGG CTGGCCGTCA GCCCGATGGC GTTCTTCGAT AAGTTCGAGC GCTGGCAAAA GAAGTCGCTG ATGATCTACG GCAAGTACGA CCTCACGTTC CTGCCGGAGT TCTCGCAGCA GATCGCCGCC GAGTTCAAGC GCCGTGGATT AGACACGCTA GTGAAAGCGC TGCCGTGCGG ACACTACTCG CTGGGCGAGA CGCCCTACAA ATACATGGAC GCGTGGCATA TTTCGCGGTT CCTGCGGCGA GCGTTTGGGG CGCATATGCA GACACAGCAC GCGGTGTAG
|
Protein sequence | MTTRDTNRIV RPFEWGQEWT RDFPGADRLQ WGETAAEHFD YFTELNRHIV EHSDEFFSYK TPSDYRLEKR RVQVFFTGSG EPPKDPDETG TYLRFTSPHP SPYLENNVFN ARWFPARGKR AIIVLPQWNA DGISHNGFAR IFNPMGIAVL RMSKPYHDIR RPAELHRADY AVSSNVGRTI HAARQGITDI RAALDWLNSE GYTQLGILGT SLGSCYAFIA SAHDERLRVN VFNHASTYFG DVVWTGQSTR HVRAGIEEVG LDMDALRKIW LAVSPMAFFD KFERWQKKSL MIYGKYDLTF LPEFSQQIAA EFKRRGLDTL VKALPCGHYS LGETPYKYMD AWHISRFLRR AFGAHMQTQH AV
|
| |