Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0434 |
Symbol | |
ID | 4069660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 512966 |
End bp | 514345 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982438 |
Product | hypothetical protein |
Protein accession | YP_589513 |
Protein GI | 94967465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.342382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0528043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATAG ATGTGATCGG GCGTTTTCAG CGCCTGCGCT CGGCAATCGC TTCCTACCTC GCCATGGGGA TGCTGCTCGG CATCTTCCTT GCGGCAAGCA GCCAATCCAG TTTTGCGATC GCGGCTTTCG CACGCAAGTA CGGATTGCCG TGTTCCGCGT GTCACGATGC ATGGCCCAAA CTCAATAATT TCGGCCAGAC GTTCAAGGAC AACGGATACC AGTTGATGAA CGACCGCGAC GCTCCGATCT GGCAGAATCC GAGCTACTGG CCGGTGGCGT TCCGTATCAC ACCGAATTGG GACCTCGAGA ACACTGGCAG GGTGGCAACC GACCAGGCGC CCAACGGACA GTCGGTCACA ACTCACGGTT TCAATCTAAG CGGCCTCGAC ATTCTCACCG CGGGCACGCT CAACAAAGAC ATCTCCTTCC TGCTAGTCCC TTCTGCCGAT GAAACGGGCG CCTTCCATTT CGAATCAGCT TGGGTACGGT TCGACAACCT GTTCCACAGC TCGTGGGCGA ATTTGAAGGT CGGCAAGTTC GAGCTCGACA ACCTGATCTC GGAAAAACGA GGTCTCACCC TCTCAGCGAA CGGCGGCTCT TACCAGCTTT ACCACTTCCT GCCGTTTGGC GACTCGAACC CCTACCAGTT CGGTATCGGC GACAACCAGA TGGGAATCGA GTTCGCCGGC CACTCGAAGA ACGACTACAC GCGTTTCTCG GCCTCGCTGC TCAGCAGTAA TGACGGCAAC GTGGACGTTC CTTACGGCAG CACGTATGAC GGGAGCTTCA CCTTCAGCAC CGCCTTCAAC GCGGGCAGTT TAGGTCTACA GCGAATTGGG GCGCAGGCTT ACATCGGGCA GGCCCCGACG TACTACCTCA CATCCGGTGG CGGCGATATC CCAGGCACCG GCAAGGGCAA TCACGGCTTC AGCCGTGAAG CGGTTTTTGC CCTGCTCTAC TTCGGCAAAT TCGACGTTAC GCCGCTGTTC GGGCACGGTA GCGAGAGCGC ATACCTGGCG AATTACGTTT CGACCGGCGG CATGCCCCCG GTATTGCCAG CGGGCTCGCG CGATCCATCG TGGAACAACT TCATGGTGGA ATCTCACTAC CTGTTCAACC CGCAGTTCAT CATGACGTAT CGCTACGACG CCATCTACAT GACGCAGCAG GCGAGCACCG CGTATCCGGA TGATGCCGGC AACACCACGG CGAATACGAT TGCTGCCCGC TATTACCCGT TCATGCATAG CCGCGCGGGT TTCGCGCTGC ATGGCGAGTT CTCGCACCTG AACCAGAAAC ACGTTTTCTC AACCACAACC GGAACGTTGC AGGACGTTAG CTACTACAGC GTCTTTGGCG GCATGGACTT CATTTTCTAA
|
Protein sequence | MRIDVIGRFQ RLRSAIASYL AMGMLLGIFL AASSQSSFAI AAFARKYGLP CSACHDAWPK LNNFGQTFKD NGYQLMNDRD APIWQNPSYW PVAFRITPNW DLENTGRVAT DQAPNGQSVT THGFNLSGLD ILTAGTLNKD ISFLLVPSAD ETGAFHFESA WVRFDNLFHS SWANLKVGKF ELDNLISEKR GLTLSANGGS YQLYHFLPFG DSNPYQFGIG DNQMGIEFAG HSKNDYTRFS ASLLSSNDGN VDVPYGSTYD GSFTFSTAFN AGSLGLQRIG AQAYIGQAPT YYLTSGGGDI PGTGKGNHGF SREAVFALLY FGKFDVTPLF GHGSESAYLA NYVSTGGMPP VLPAGSRDPS WNNFMVESHY LFNPQFIMTY RYDAIYMTQQ ASTAYPDDAG NTTANTIAAR YYPFMHSRAG FALHGEFSHL NQKHVFSTTT GTLQDVSYYS VFGGMDFIF
|
| |