Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0857 |
Symbol | |
ID | 5710547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 870602 |
End bp | 873940 |
Gene Length | 3339 bp |
Protein Length | 1112 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641266767 |
Product | transglutaminase domain protein |
Protein accession | YP_001532203 |
Protein GI | 159043409 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.250645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.314962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCA CGGCCAGCAT CTATCACCTG ACCCATTACA AGTACGATCG CCCCGTGACA CTCGGGCCGC AGATCATCCG CCTGCGCCCG GCGCCGCACA GCCGGACCCG GGTGATCTCC CACGCGCTGA AGGTGTCGCC CGAGGGGCAT TTCGTGAACC ACCAGCAGGA CCCCTATGGC AACTGGATGG CGCGCTTCGT CTTCCCGGAG CCGGTGCGGG AGCTGAAGAT CGAGGTCGAC CTGGTCGCGG ACATGTCGGT CTACAACCCG TTCGACTTCT TCGTGGAGGA CACCGCCGAG GAATGGCCCT TTGCCTATCC CGAGGATCTC GCCGCGGATC TGAGCATCTA CAAGGCGCCC GAGGCGGCGG GGCCGCGCCT GCGGGCCTAT CTCGACGCGC TGCCGCGGGA CACGTCGCGG ACGGTGGATT TCCTCGTCGG GCTGAACGCG CGGATCGCGC GGGATATCGG CTATGTCATC CGGATGGAGC CGGGGGTGCA GACCGCTGAA GAGACCCTGA CCGTGGGCAA GGGGTCGTGC CGGGACAGCT CCTGGCTGCT GATCCAGATC CTGCGGCACC TGGGCTTCGC GGCGCGGTTC TGTTCGGGCT ACCTGATCCA GTTGAAACCG GATCTCGAGG CGCTGGACGG CCCGTCGGGG ACGGATCACG ATTTTACCGA CCTGCATGCC TGGGCCGAGG TGTACCTGCC CGGCGCGGGG TGGATCGGGC TCGACCCCAC CAGCGGGCTT CTGGCGGGCG AGAGCCACAT CCCGCTGGCC GCCACCCCCC ATTACCGCAA CGCCGCCCCG ATTGCCGGCG TGGCCAGTGC CGCCAATGTC GAGTTCAACT TCGACATGCA GGTCACGCGC ACCGCCGAGC ATCCGCGCAT CACCAAGCCG TTTTCCGATA CCGCGTGGGA GGCTCTGAAC GATCTGGGCC GCAAGGTCGA CGCCAAGCTC GATGCCGGCG ACGTGCGGCT GACCATGGGG GGCGAGCCGA CCTTCGTGTC GATCGACGAT TTCGAGAGCG CGGAATGGAA CACCGCTGCC GTGGGCCCGA CCAAGCGGGC GCTGGCCGAT ACGCTGATCC GGCGGTTGCG CGACCGCTTT GCGCCGGGCG GGTTTCTGCA TTACGGGCAG GGCAAGTGGT ATCCGGGGGA GACCCTGCCG CGCTGGACCT TCTCGCTCTA CTGGCGCGGG GACGGCAAGC CGCTCTGGAC CAATGCGGAC CTGATCGCGC GGGAGAAGAC CGACGCCACC CCCGGCCCGG AGGCGGCCGA ATCCCTGCTG TCGGGCATCG CGGCAGCCCT TGGCGTGCCC GAGGACAACG TGACCCCCGC CTATGAGGAC CCGGCCGAGT GGCTGGTGCG CGAGGCGCAG CTGCCCGAGA ACGTGAGCCC CGAGAATTCC AAGCTGAAAG ACCCCGAAGA GCGCCACCGC ATCGCGCGGG TGTTCGAGCG CGGCCTGACC GAGCCGTCGG GCTACGTCCT GCCGGTGCAG CGCTGGCAGG CGAAGGCGGG CGCGGGCTGG CGCTCGGAGC GCTGGTCGCT CCGGCGTGGC AAGCTGTTCC TGACCCCGGG CGACAGCCCC GTGGGCTACC GGCTGCCCCT GGGCTCCCTG CCGCATGTGC CGGCGGCCAG CTACCCCTAC ACCTATCCGG CCGACCCGTT TGCGGAGCAG GCCCCCCTGC CCGAGCCCGA CCTCGTGGCG CGCGAAGCCG AGCGTCTGCG CGACCGGCGC GAACGCGCCC CTGCCAGCAG CACAGCCTCC GAAGGCGGCG AGACCCAGAC CGCGCAGTAC CTGACCGAGA TCGGCGGCCC GGTGCGCACC GCGATCAGCG TCGAGCCGCG CGAGGGCAAG CTCTGCGTCT TCCTGCCGCC GGTGGAGTCG GCGGCGGATT ACCTCGAGTT GCTCGCCGCG ACCGAAGCCG CCGCCACCGC CCAGGGCACG CCCGTCCATA TCGAAGGTTA CGCGCCGCCG GGCGACAGCC GCCTCAACGT GATCCGCGTG GCCCCGGACC CGGGCGTGAT CGAGGTGAAC GTCCATCCCG CGCACAGCTG GGACGACTGC GTCGCGATCA CCGAAGCGGT CTATGAAGAG GCGCGGCAAG CCCGGCTTGG CGCGGACAAG TTCATGATCG ACGGGCGCCA TACCGGCACC GGCGGCGGCA ACCATGTGGT CGTCGGCGGG GCCACCCCCG AGGACAGCCC CTTCCTGCGG CGCCCGGACC TGCTGCGCTC GCTGATCCTG CACTGGAACC GGCATCCGAG CCTCTCTTAC CTGTTCTCCG GCGTTTTCGT GGGCCCCACG AGCCAGGCGC CGCGCATCGA CGAGGCCCGG CATGACAGCC TTTACGAGTT GGAAATCGCG CTCGCCAACA TCCCCGATCC GGACAAGGGC GCGGCCCCGA CGCCCTGGCT GGTGGACCGG CTCTTGCGCA ACGCGCTGAT CGACGTGACG GGCAACACGC ACCGGTCCGA GATCTGCATC GACAAGCTCT ATTCCCCCGA CGGCCCCACG GGGCGACTGG GCCTCGTGGA GTTCCGCGGC TTCGAGATGC CGCCCAACCC GCGTATGTCG CTGGCCCAGC AGGTGTTGAT CCGCGCCCTT ATCGCCCGGT TCTGGGAGGC GCCGCTGTCG GGGCCGCTGA CCCGCTGGGG CACGCAGCTC CATGACCGGT TCATGCTGCC CCATTTCCTG TGGGAGGATT TCACCGAGGT CTGCCGCGAT CTGGCCGCGC ACGGCCTGCC CGTGGACCCG GCCTGGTTCA CCGCCCAGTC GGAGTTCCGC TTCCCGTTTT GCGGCGAGGT CACGTACGAG GGCATGACGC TGGAGCTGCG CCAGGCGCTG GAGCCGTGGC ATGTGCTGGG CGAGACGGGG GCGATCGGCG GCACGGTGCG CTACACCGAC AGTTCGGTGG AGCGCTTGCA GGTCAAGCTG ACCGGGGCGG ACCCCGCGCG CTACACCGTG ACCGCCAATG GCCGCCGCGT GCCCCTGCGC CCGACCGCCT GCCCCGAAAC CCGGGTGGCG GCGGTCCGCT ACAAGGCGTG GCAACCGGCC GAGAGCCTGC ATCCGACCCT GCCCGTGCAG GCCCCGCTGA CCTTCGACAT CTTCGACAGC TGGACCGGGC GGGCCCTGGG CGGCTGCCGC TACCACGTGG CCCATCCGGG CGGGCGCAAT TACGACACCT TTCCGGTCAA CGGCAACGAG GCCGAGGCCC GCCGCCTTGC CCGGTTCGAA GCGCACGGTC ATACTGCAGG CGACCACACG CCGCTTGCGG AAACCCCGCA CCCGGAATTC CCGATGACAC TGGACCTGAG ACGGCCCCCG AATGTCTGA
|
Protein sequence | MSITASIYHL THYKYDRPVT LGPQIIRLRP APHSRTRVIS HALKVSPEGH FVNHQQDPYG NWMARFVFPE PVRELKIEVD LVADMSVYNP FDFFVEDTAE EWPFAYPEDL AADLSIYKAP EAAGPRLRAY LDALPRDTSR TVDFLVGLNA RIARDIGYVI RMEPGVQTAE ETLTVGKGSC RDSSWLLIQI LRHLGFAARF CSGYLIQLKP DLEALDGPSG TDHDFTDLHA WAEVYLPGAG WIGLDPTSGL LAGESHIPLA ATPHYRNAAP IAGVASAANV EFNFDMQVTR TAEHPRITKP FSDTAWEALN DLGRKVDAKL DAGDVRLTMG GEPTFVSIDD FESAEWNTAA VGPTKRALAD TLIRRLRDRF APGGFLHYGQ GKWYPGETLP RWTFSLYWRG DGKPLWTNAD LIAREKTDAT PGPEAAESLL SGIAAALGVP EDNVTPAYED PAEWLVREAQ LPENVSPENS KLKDPEERHR IARVFERGLT EPSGYVLPVQ RWQAKAGAGW RSERWSLRRG KLFLTPGDSP VGYRLPLGSL PHVPAASYPY TYPADPFAEQ APLPEPDLVA REAERLRDRR ERAPASSTAS EGGETQTAQY LTEIGGPVRT AISVEPREGK LCVFLPPVES AADYLELLAA TEAAATAQGT PVHIEGYAPP GDSRLNVIRV APDPGVIEVN VHPAHSWDDC VAITEAVYEE ARQARLGADK FMIDGRHTGT GGGNHVVVGG ATPEDSPFLR RPDLLRSLIL HWNRHPSLSY LFSGVFVGPT SQAPRIDEAR HDSLYELEIA LANIPDPDKG AAPTPWLVDR LLRNALIDVT GNTHRSEICI DKLYSPDGPT GRLGLVEFRG FEMPPNPRMS LAQQVLIRAL IARFWEAPLS GPLTRWGTQL HDRFMLPHFL WEDFTEVCRD LAAHGLPVDP AWFTAQSEFR FPFCGEVTYE GMTLELRQAL EPWHVLGETG AIGGTVRYTD SSVERLQVKL TGADPARYTV TANGRRVPLR PTACPETRVA AVRYKAWQPA ESLHPTLPVQ APLTFDIFDS WTGRALGGCR YHVAHPGGRN YDTFPVNGNE AEARRLARFE AHGHTAGDHT PLAETPHPEF PMTLDLRRPP NV
|
| |