Gene Dshi_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0857 
Symbol 
ID5710547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp870602 
End bp873940 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content70% 
IMG OID641266767 
Producttransglutaminase domain protein 
Protein accessionYP_001532203 
Protein GI159043409 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.250645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.314962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA CGGCCAGCAT CTATCACCTG ACCCATTACA AGTACGATCG CCCCGTGACA 
CTCGGGCCGC AGATCATCCG CCTGCGCCCG GCGCCGCACA GCCGGACCCG GGTGATCTCC
CACGCGCTGA AGGTGTCGCC CGAGGGGCAT TTCGTGAACC ACCAGCAGGA CCCCTATGGC
AACTGGATGG CGCGCTTCGT CTTCCCGGAG CCGGTGCGGG AGCTGAAGAT CGAGGTCGAC
CTGGTCGCGG ACATGTCGGT CTACAACCCG TTCGACTTCT TCGTGGAGGA CACCGCCGAG
GAATGGCCCT TTGCCTATCC CGAGGATCTC GCCGCGGATC TGAGCATCTA CAAGGCGCCC
GAGGCGGCGG GGCCGCGCCT GCGGGCCTAT CTCGACGCGC TGCCGCGGGA CACGTCGCGG
ACGGTGGATT TCCTCGTCGG GCTGAACGCG CGGATCGCGC GGGATATCGG CTATGTCATC
CGGATGGAGC CGGGGGTGCA GACCGCTGAA GAGACCCTGA CCGTGGGCAA GGGGTCGTGC
CGGGACAGCT CCTGGCTGCT GATCCAGATC CTGCGGCACC TGGGCTTCGC GGCGCGGTTC
TGTTCGGGCT ACCTGATCCA GTTGAAACCG GATCTCGAGG CGCTGGACGG CCCGTCGGGG
ACGGATCACG ATTTTACCGA CCTGCATGCC TGGGCCGAGG TGTACCTGCC CGGCGCGGGG
TGGATCGGGC TCGACCCCAC CAGCGGGCTT CTGGCGGGCG AGAGCCACAT CCCGCTGGCC
GCCACCCCCC ATTACCGCAA CGCCGCCCCG ATTGCCGGCG TGGCCAGTGC CGCCAATGTC
GAGTTCAACT TCGACATGCA GGTCACGCGC ACCGCCGAGC ATCCGCGCAT CACCAAGCCG
TTTTCCGATA CCGCGTGGGA GGCTCTGAAC GATCTGGGCC GCAAGGTCGA CGCCAAGCTC
GATGCCGGCG ACGTGCGGCT GACCATGGGG GGCGAGCCGA CCTTCGTGTC GATCGACGAT
TTCGAGAGCG CGGAATGGAA CACCGCTGCC GTGGGCCCGA CCAAGCGGGC GCTGGCCGAT
ACGCTGATCC GGCGGTTGCG CGACCGCTTT GCGCCGGGCG GGTTTCTGCA TTACGGGCAG
GGCAAGTGGT ATCCGGGGGA GACCCTGCCG CGCTGGACCT TCTCGCTCTA CTGGCGCGGG
GACGGCAAGC CGCTCTGGAC CAATGCGGAC CTGATCGCGC GGGAGAAGAC CGACGCCACC
CCCGGCCCGG AGGCGGCCGA ATCCCTGCTG TCGGGCATCG CGGCAGCCCT TGGCGTGCCC
GAGGACAACG TGACCCCCGC CTATGAGGAC CCGGCCGAGT GGCTGGTGCG CGAGGCGCAG
CTGCCCGAGA ACGTGAGCCC CGAGAATTCC AAGCTGAAAG ACCCCGAAGA GCGCCACCGC
ATCGCGCGGG TGTTCGAGCG CGGCCTGACC GAGCCGTCGG GCTACGTCCT GCCGGTGCAG
CGCTGGCAGG CGAAGGCGGG CGCGGGCTGG CGCTCGGAGC GCTGGTCGCT CCGGCGTGGC
AAGCTGTTCC TGACCCCGGG CGACAGCCCC GTGGGCTACC GGCTGCCCCT GGGCTCCCTG
CCGCATGTGC CGGCGGCCAG CTACCCCTAC ACCTATCCGG CCGACCCGTT TGCGGAGCAG
GCCCCCCTGC CCGAGCCCGA CCTCGTGGCG CGCGAAGCCG AGCGTCTGCG CGACCGGCGC
GAACGCGCCC CTGCCAGCAG CACAGCCTCC GAAGGCGGCG AGACCCAGAC CGCGCAGTAC
CTGACCGAGA TCGGCGGCCC GGTGCGCACC GCGATCAGCG TCGAGCCGCG CGAGGGCAAG
CTCTGCGTCT TCCTGCCGCC GGTGGAGTCG GCGGCGGATT ACCTCGAGTT GCTCGCCGCG
ACCGAAGCCG CCGCCACCGC CCAGGGCACG CCCGTCCATA TCGAAGGTTA CGCGCCGCCG
GGCGACAGCC GCCTCAACGT GATCCGCGTG GCCCCGGACC CGGGCGTGAT CGAGGTGAAC
GTCCATCCCG CGCACAGCTG GGACGACTGC GTCGCGATCA CCGAAGCGGT CTATGAAGAG
GCGCGGCAAG CCCGGCTTGG CGCGGACAAG TTCATGATCG ACGGGCGCCA TACCGGCACC
GGCGGCGGCA ACCATGTGGT CGTCGGCGGG GCCACCCCCG AGGACAGCCC CTTCCTGCGG
CGCCCGGACC TGCTGCGCTC GCTGATCCTG CACTGGAACC GGCATCCGAG CCTCTCTTAC
CTGTTCTCCG GCGTTTTCGT GGGCCCCACG AGCCAGGCGC CGCGCATCGA CGAGGCCCGG
CATGACAGCC TTTACGAGTT GGAAATCGCG CTCGCCAACA TCCCCGATCC GGACAAGGGC
GCGGCCCCGA CGCCCTGGCT GGTGGACCGG CTCTTGCGCA ACGCGCTGAT CGACGTGACG
GGCAACACGC ACCGGTCCGA GATCTGCATC GACAAGCTCT ATTCCCCCGA CGGCCCCACG
GGGCGACTGG GCCTCGTGGA GTTCCGCGGC TTCGAGATGC CGCCCAACCC GCGTATGTCG
CTGGCCCAGC AGGTGTTGAT CCGCGCCCTT ATCGCCCGGT TCTGGGAGGC GCCGCTGTCG
GGGCCGCTGA CCCGCTGGGG CACGCAGCTC CATGACCGGT TCATGCTGCC CCATTTCCTG
TGGGAGGATT TCACCGAGGT CTGCCGCGAT CTGGCCGCGC ACGGCCTGCC CGTGGACCCG
GCCTGGTTCA CCGCCCAGTC GGAGTTCCGC TTCCCGTTTT GCGGCGAGGT CACGTACGAG
GGCATGACGC TGGAGCTGCG CCAGGCGCTG GAGCCGTGGC ATGTGCTGGG CGAGACGGGG
GCGATCGGCG GCACGGTGCG CTACACCGAC AGTTCGGTGG AGCGCTTGCA GGTCAAGCTG
ACCGGGGCGG ACCCCGCGCG CTACACCGTG ACCGCCAATG GCCGCCGCGT GCCCCTGCGC
CCGACCGCCT GCCCCGAAAC CCGGGTGGCG GCGGTCCGCT ACAAGGCGTG GCAACCGGCC
GAGAGCCTGC ATCCGACCCT GCCCGTGCAG GCCCCGCTGA CCTTCGACAT CTTCGACAGC
TGGACCGGGC GGGCCCTGGG CGGCTGCCGC TACCACGTGG CCCATCCGGG CGGGCGCAAT
TACGACACCT TTCCGGTCAA CGGCAACGAG GCCGAGGCCC GCCGCCTTGC CCGGTTCGAA
GCGCACGGTC ATACTGCAGG CGACCACACG CCGCTTGCGG AAACCCCGCA CCCGGAATTC
CCGATGACAC TGGACCTGAG ACGGCCCCCG AATGTCTGA
 
Protein sequence
MSITASIYHL THYKYDRPVT LGPQIIRLRP APHSRTRVIS HALKVSPEGH FVNHQQDPYG 
NWMARFVFPE PVRELKIEVD LVADMSVYNP FDFFVEDTAE EWPFAYPEDL AADLSIYKAP
EAAGPRLRAY LDALPRDTSR TVDFLVGLNA RIARDIGYVI RMEPGVQTAE ETLTVGKGSC
RDSSWLLIQI LRHLGFAARF CSGYLIQLKP DLEALDGPSG TDHDFTDLHA WAEVYLPGAG
WIGLDPTSGL LAGESHIPLA ATPHYRNAAP IAGVASAANV EFNFDMQVTR TAEHPRITKP
FSDTAWEALN DLGRKVDAKL DAGDVRLTMG GEPTFVSIDD FESAEWNTAA VGPTKRALAD
TLIRRLRDRF APGGFLHYGQ GKWYPGETLP RWTFSLYWRG DGKPLWTNAD LIAREKTDAT
PGPEAAESLL SGIAAALGVP EDNVTPAYED PAEWLVREAQ LPENVSPENS KLKDPEERHR
IARVFERGLT EPSGYVLPVQ RWQAKAGAGW RSERWSLRRG KLFLTPGDSP VGYRLPLGSL
PHVPAASYPY TYPADPFAEQ APLPEPDLVA REAERLRDRR ERAPASSTAS EGGETQTAQY
LTEIGGPVRT AISVEPREGK LCVFLPPVES AADYLELLAA TEAAATAQGT PVHIEGYAPP
GDSRLNVIRV APDPGVIEVN VHPAHSWDDC VAITEAVYEE ARQARLGADK FMIDGRHTGT
GGGNHVVVGG ATPEDSPFLR RPDLLRSLIL HWNRHPSLSY LFSGVFVGPT SQAPRIDEAR
HDSLYELEIA LANIPDPDKG AAPTPWLVDR LLRNALIDVT GNTHRSEICI DKLYSPDGPT
GRLGLVEFRG FEMPPNPRMS LAQQVLIRAL IARFWEAPLS GPLTRWGTQL HDRFMLPHFL
WEDFTEVCRD LAAHGLPVDP AWFTAQSEFR FPFCGEVTYE GMTLELRQAL EPWHVLGETG
AIGGTVRYTD SSVERLQVKL TGADPARYTV TANGRRVPLR PTACPETRVA AVRYKAWQPA
ESLHPTLPVQ APLTFDIFDS WTGRALGGCR YHVAHPGGRN YDTFPVNGNE AEARRLARFE
AHGHTAGDHT PLAETPHPEF PMTLDLRRPP NV