Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0854 |
Symbol | |
ID | 4070987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1064014 |
End bp | 1065156 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982863 |
Product | hypothetical protein |
Protein accession | YP_589933 |
Protein GI | 94967885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGATA GCGTTCAACT CAACCTGATG ATGGGACCGT TTTTCCCTAT GACGCCGCCG CGCCCAGTCA TGGACGCCCT CGACAGTGTT GAGGTGACGG TGAATGACAC CGGCACAAGC GGATTCCAGC TTACGTTCTT GATCGACAAA CAATCGCCGC TCAACATCAT GTTCTTGCTC ACGGGTGGAC TCCCACTTCT ATTCATGCGG GTTGTTATCG TCGCCATCGT GAATGGAGTC TCGAATGTCC TGATCGACGG AGTCATCACC AACAACCACA TCTCTCCCGG AGACAAGGGC TCGAACTCGA CACTGACCCT GACCGGCGAA GATCTCACCG CGCTCATGAA CCAGTCCGAT TGGAGCGGTT TCCCCTTCCC GGCCTGCCCT GCGGAAGCGC GCGTCGCTCT CATCTGCGCG AAGTATGCGA TTTTCGGCGT CATTCCCTTG ATCATCCCCA GCGTGTTAAT CGACGTGCCG TTGCCGATCG ACATGATCCC CAGCCAACAG GGCACCGATC TCGCCTACGT TCGCGCGCTC GCCGATCGCG TCGGATACGT CTTCTATATC GATCCCGGAC CGGCTCCAGG CATCAGCAAA GCCTACTGGG GACCACAAAT CAAGTTCGGC GCAATTCAGC CCGCCCTCAA CATCGACATG GATGCATACA CCAACGTCGA AAATCTCACC TTCAATTTCG ATCAGCAGCA GAACCGGATT CCGATCGTCT ACATTTACAA CCAGCAAACC GGCGTTTCTA TTCCGATTCC AATTCCGCCG ATCACGCCCC TAAATCCGCC ACTCGGACTG ATTCCGCCAC TGCCGTCGAA CATCCCACCC GATCTCACGC CGATCCGCGA CGACCTTTCG AAGCGCCCAA TCCCCCAGAC CATCATGATC GGCCTAGCTG CAGCGTCGCA ATGGGCAGAT GCAGTTACCG GTGAAGGCAC CCTCGATGTT GTGCGTTACG GCGGAGTTCT CAAAGCCCGC GAGCTCGTAG GCGTGCGAGG CGCGGGACCT GCCTTCGACG GTCTTTATTA CGTAAAGAGT GTCACCCACA AAATCAAGCG TGGCGAATAC AAGCAGAGTT TCAAGCTGAG CCGTAACGGC TTGGTATCCA CAGTTTCCAC GGTGCCCTCA TGA
|
Protein sequence | MLDSVQLNLM MGPFFPMTPP RPVMDALDSV EVTVNDTGTS GFQLTFLIDK QSPLNIMFLL TGGLPLLFMR VVIVAIVNGV SNVLIDGVIT NNHISPGDKG SNSTLTLTGE DLTALMNQSD WSGFPFPACP AEARVALICA KYAIFGVIPL IIPSVLIDVP LPIDMIPSQQ GTDLAYVRAL ADRVGYVFYI DPGPAPGISK AYWGPQIKFG AIQPALNIDM DAYTNVENLT FNFDQQQNRI PIVYIYNQQT GVSIPIPIPP ITPLNPPLGL IPPLPSNIPP DLTPIRDDLS KRPIPQTIMI GLAAASQWAD AVTGEGTLDV VRYGGVLKAR ELVGVRGAGP AFDGLYYVKS VTHKIKRGEY KQSFKLSRNG LVSTVSTVPS
|
| |