Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2813 |
Symbol | |
ID | 4071816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3337543 |
End bp | 3338379 |
Gene Length | 837 bp |
Protein Length | 278 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984831 |
Product | Cof protein |
Protein accession | YP_591888 |
Protein GI | 94969840 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATG CGCCGATCAA GCTGCTGGTG TCGGATGTGG ACGGCACATT GCTAGACCCG AAGAAGCAGC TTACCGAACC GGTGCGGCAG GCAGTGTTGC GGGTCAAGCA GGCCGGGCTG AAGTTCACCA TTGTGTCGGC TCGTCCGCCG CTCGGCACGA CTTTCCTCAT TGATGCGCTC GACATCACCG AGCCGATTGC GTGCTTCAAC GGCGCGCTGA CCTGCACACC ACAGTACGAA ATCCTGCATC AGATTCTGCT GGCCGCAGAC GTCGCGACGC AGGTTGCACA AACGATCCTC GAACACGGAC TCGACCTGTG GATGTTCCGT GGCGCGGAAT GGTGGGTGAG CAAGCTGAAC GGGCCGCATA CCGAGGGACA CATCAAGTTG ATGCGGCACG AGCCGCGCTA CCTCGGCGAG GATGTCACAT TGTGCGCGCG GGCGAACAAA CTGGTTGGCG TGAGCGACGA CCACGAGGCG GTAAAGCGCT GTGAAAAAGA TGTGATCGCG AAGTGCGGCG ACCGCGTATC GGCAACGAGA TCCTCCGACT ACTACCTCGA TGTCACGGAC CATGATGCGA ACAAGGGAAA CGCTGTGGTG CAACTCGCGA AGCTGATGGA TATTCCTCTG GAGAACGTGG CGACGATCGG CGATATGCCA ACCGACATGT TCATGTTCGC CAAGAGCGGT ATGAGCATCG CGATGGGGAA TGCGAGCGAT GAGGTCAAAG CGGCAGCCAC ATTCACGACG ACAAGCAATG CGGAAGGTGG CTACGTGAAG GCGATGGATG AGATCGTGCT GCCACGCGTC GCGCAACGTG CAGGAGCGCA AGGATGA
|
Protein sequence | MSDAPIKLLV SDVDGTLLDP KKQLTEPVRQ AVLRVKQAGL KFTIVSARPP LGTTFLIDAL DITEPIACFN GALTCTPQYE ILHQILLAAD VATQVAQTIL EHGLDLWMFR GAEWWVSKLN GPHTEGHIKL MRHEPRYLGE DVTLCARANK LVGVSDDHEA VKRCEKDVIA KCGDRVSATR SSDYYLDVTD HDANKGNAVV QLAKLMDIPL ENVATIGDMP TDMFMFAKSG MSIAMGNASD EVKAAATFTT TSNAEGGYVK AMDEIVLPRV AQRAGAQG
|
| |