Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0855 |
Symbol | |
ID | 5710545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 867282 |
End bp | 868169 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641266765 |
Product | transglutaminase domain protein |
Protein accession | YP_001532201 |
Protein GI | 159043407 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.21199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.166966 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTATG ACATCCGGCT GAAGCTCGAT CACACCTATG CGGGCCGGGC GGGCGGCGCG CGGCACCTGA ACCGGGTGAT GCCCGCGGCC CTGCCCGGAC GACAGATCGT GCAGACCAGC CTGCTGACCT GCGACCCCGT GGCGGACACG CGGAACGCTT TCGTGGATTT CTTCGGCAAC CGGGCGACGG TGTTCTACCA CGACGCGCCC CATGACAAGA TGCGGATCAC CATGCAGGCG CGTGTGCGCT GCCTGACGCG CGAGCCGCAG CTCGACATCT CCCCCGACCG GAACCGGCTC GCACAGGAAA TCGCCGAGGT TCGCCATCTC GACGGGCAGG CCCCGCATCA CTTCCTCGGC CCCTCGCCGC GGGTGCCGGC CCGGTCCGAG ATCGCGGCCT TTGCGGCCGA CATCACCGCG CCGGTCACGG GGGGGGTGCT GGCGCAGATG CAGGCGCTGG GCCACGCGCT CCATGCCGAG ATGACTTTCG ATGCGACAGC GACCACGGTG GACACGCCGA TGATCGACGC CTTCGTCAAC CGCCACGGCG TCTGCCAGGA TTACAGTCAT ATCTTCATCG CGGCCCTGCG CAGCCTGGCG ATCCCCGCAC GCTATGTCAG CGGCTTCCTG CGGACCCTGC CCCCGCCGGG TCAGCCCCGG CTGGAAGGCG CGGACGCGAT GCATGCCTGG GTCAGCGCCT GGTGCGGCTC GGAACTGGGC TGGGTCGAGT ACGACCCGAC AAACGATATC ACCGTGCAGA CGGATCATAT CGTGGTGGCC TATGGCCGCG ATTATTCGGA TGTCTCGCCG ATCAAGGGGG TCTTGCGCAC CGCCGCCAAG GGCACGAGTG CCCAATCGGT GGATGTGGCA CCGGTCGGCG CGGTCTGA
|
Protein sequence | MLYDIRLKLD HTYAGRAGGA RHLNRVMPAA LPGRQIVQTS LLTCDPVADT RNAFVDFFGN RATVFYHDAP HDKMRITMQA RVRCLTREPQ LDISPDRNRL AQEIAEVRHL DGQAPHHFLG PSPRVPARSE IAAFAADITA PVTGGVLAQM QALGHALHAE MTFDATATTV DTPMIDAFVN RHGVCQDYSH IFIAALRSLA IPARYVSGFL RTLPPPGQPR LEGADAMHAW VSAWCGSELG WVEYDPTNDI TVQTDHIVVA YGRDYSDVSP IKGVLRTAAK GTSAQSVDVA PVGAV
|
| |