Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1567 |
Symbol | |
ID | 4068676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1915802 |
End bp | 1916842 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983576 |
Product | hypothetical protein |
Protein accession | YP_590643 |
Protein GI | 94968595 |
COG category | [R] General function prediction only |
COG ID | [COG0820] Predicted Fe-S-cluster redox enzyme |
TIGRFAM ID | [TIGR00048] radical SAM enzyme, Cfr family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00430195 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00703244 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCGTC TTGGCCAGCC CGCCTACCGC TCCCGGCAGC TTTGGCAAGG CCTTTACCGC GACCGAATCG CTTCACTCGA CCAGTTCACC ACCCTCCCCA TCCCCCTCCG CGAGGAGCTC AAATCCTCAG GTTGGGCCAT CGCTTTTCCC TTCGTCCAGA AGCGTTTCAC CTCCACCGAC GGCACCGTAC GTTACTTATT GCAGTTCTCC GACGGCCAAT CCGTCGAGAC CGTCTGGATG CCCGAGGGCG ACGGTGGCGA GCAAGGCGAC GGCTCCGAAG ACGGCCCCTC CTACGACCGA GCCACCATCT GCGTCTCCAG CCAGGTCGGC TGCGCCGTTG ATTGCCAGTT CTGCATGACC GCCTTGCTCG GCCTTCTCCG TAATCTTTCC GCCGGAGAAA TCGTTGGCCA AATCCTCGCC GTGCTCAAAG ATGAGAACGT GGATGTCGAG AAAAGCCGCA TCAATCTCGT CTTCATGGGC CAGGGCGAGC CCTTCCTGAA CTTCGACAAC TTCGTGAAGG CTGTCACGCT TCTTGCTGAA GCCGTTGGGA TTCCCGAATC CCGCATGACC GTCTCGACCT CCGGTATCGT CCCGCGCATC GTCGATTTCG GTCAGCTCGC GATCCGTCCC AAACTAGCAA TCTCGCTCAA CGCCTCCAAC GACGAATCCC GCCGCGAACT CATGCCGATC ACCAAGAAGT GGACGCTCGA AAAGCTGATG TCCGCGGCGC GCGAGTTCCC TCTCCGCAAC CGCGAGCGCA TGACCTTCGA GTACGTTCTC CTGGGTGGCG TCAACGACAG CGAGCAGAAT GCCCGCGAAG TGGTTCAACT GCTGCGCGGC CTCCGCGCCA AGGTAAATCT CATCGCCTGG AACCCCGGCC CCGAGATCCC CTTCTCCACG CCCGATCCCC AGCACGTGGA AGCCTTTCAA CAGATCCTCA TCGACGCCGG CATCCCCACA TTCATCCGCA AGCCGCGTGG ACGAGACATC TTCGCCGCCT GCGGACAGTT GAAGCGCACG GAACTCGTCA CTCTCAGCTA A
|
Protein sequence | MERLGQPAYR SRQLWQGLYR DRIASLDQFT TLPIPLREEL KSSGWAIAFP FVQKRFTSTD GTVRYLLQFS DGQSVETVWM PEGDGGEQGD GSEDGPSYDR ATICVSSQVG CAVDCQFCMT ALLGLLRNLS AGEIVGQILA VLKDENVDVE KSRINLVFMG QGEPFLNFDN FVKAVTLLAE AVGIPESRMT VSTSGIVPRI VDFGQLAIRP KLAISLNASN DESRRELMPI TKKWTLEKLM SAAREFPLRN RERMTFEYVL LGGVNDSEQN AREVVQLLRG LRAKVNLIAW NPGPEIPFST PDPQHVEAFQ QILIDAGIPT FIRKPRGRDI FAACGQLKRT ELVTLS
|
| |