Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3470 |
Symbol | |
ID | 4069046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4092990 |
End bp | 4094318 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985492 |
Product | putative thiol-disulfide isomerase or thioredoxin |
Protein accession | YP_592545 |
Protein GI | 94970497 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0652543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0361223 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGGCG CTAGCGCGGT GGCGCAACAG GGGCCTACTT TTTACAAGGA CGTCTTGCCG CTGCTGCAGG CGCATTGCCA GTCGTGCCAT CGGCGTGGCG AGATCGCGCC TATGGCGTTC ACCACTTATG AAGAGGTGAA GCCCTACGCG GACGCGATGA AGGCTGCGGT GGCGTCGAAG CGCATGCCGC CCTGGTTCGC CGACCCGCGC TGCGGGCGGT TCTCCAACGA TCCCTCGCTC AGCGATAGAC AGATCCAGAC GATTGTGAAG TGGGCCGAAG CTCACGCCCC GCGCGGAAAT CCGAAGGACG CCCCACCCGC GCCACGCTAC GCAGAAGGCT GGCTCATCCC GCAGCCCGAT GCAGTGTTCA CCATGCCGGT TCCGGTCGAT CTGCCCGCGC ACGGCGACGT GGAGTACACG TACGAGATCG TCCCGACGCA CTTTAGCGAG GGGCGGTGGG TGCAGATGTC GGAGATACGG CCGAGCTCGC GCGAGCATGT GCACCATGCG GTGGTGTACA TCCGGCCGCC GGCGTCGACC TGGCTACGCG ATGCGCCGCT GGGGAAGCCC TTTACCGCTG GCGACATGGT GGACCCAAAG GCGCATGCCG AGGCGTTGGC GACTACTTCG GACATGCTAC TTGTCTATGC GCCGGGGAGT GAGCCGGATA GGTGGCCGGA TGGGATGGCG AAGTATTTGC CTGCGGGAAG CGATCTCATT TTCCAGATGC ACTATACGAC AAATGGGCAC GGGACTACCG ATCAGACTTC GATTGGCGTA GTATTTTCCA AAGGAAAGCC GATGCAGAGA GTGCTCACAT TGCAGCTCAC GAATCATAGT TTCGTCATTC CTCCGCGAAC AGATAACTAT GAAGTCTATG AGCGCGGTAC TCTTCCGAAT GATGCGACTC TATTAAGTTT CTTTCCGCAC ATGCACTTAA GAGGGAAGAA GTTTGAATAT GACATCGTTA AACCGAATGG CGAAATAGAA CCGCTGCTGA AGGTGAACTA TCATTTCCAC TGGCAGTTAA GTTACAAGTT GGCGCAGCCG ATTCCGCTAA AGGCTGGAAC GGTGCTGCAG GCTGTCGCGA CCTTCGACAA CAGCGACGGC AACATGCACA ATCCCGATCC GTCGCAGTAC GTGAAGTGGG GCGGGCAGAC GTACGAAGAG ATGATGGTGG GGTTCTTCGA CGTCGCGGTG CCGGCTACGA CCGACAAAGA AGAGTTTTTT GAGAGGAAGA AGGGAAGTCC CCACCCTAGT GCTTCGCAAT TGAGTGGGGA AACCCTGGTT ATTTCCTGGA TTGTTCGAGC TTGGTCAGGC GCGCGTTAA
|
Protein sequence | MLGASAVAQQ GPTFYKDVLP LLQAHCQSCH RRGEIAPMAF TTYEEVKPYA DAMKAAVASK RMPPWFADPR CGRFSNDPSL SDRQIQTIVK WAEAHAPRGN PKDAPPAPRY AEGWLIPQPD AVFTMPVPVD LPAHGDVEYT YEIVPTHFSE GRWVQMSEIR PSSREHVHHA VVYIRPPAST WLRDAPLGKP FTAGDMVDPK AHAEALATTS DMLLVYAPGS EPDRWPDGMA KYLPAGSDLI FQMHYTTNGH GTTDQTSIGV VFSKGKPMQR VLTLQLTNHS FVIPPRTDNY EVYERGTLPN DATLLSFFPH MHLRGKKFEY DIVKPNGEIE PLLKVNYHFH WQLSYKLAQP IPLKAGTVLQ AVATFDNSDG NMHNPDPSQY VKWGGQTYEE MMVGFFDVAV PATTDKEEFF ERKKGSPHPS ASQLSGETLV ISWIVRAWSG AR
|
| |