Gene Dshi_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1334 
Symbol 
ID5711892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1386187 
End bp1387725 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID641267246 
Producttetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001532677 
Protein GI159043883 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.278248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0416734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGTT CCTGGCTTTC AAAGGTATTG TTTCCTATAA CGATCTGCAC GCTGATTGGT 
GGGCTTTCAT TGCAGGATGC CTTTGCCCGG TCATTACAGA TTGACGATCT GACCAGGGAG
CGGGAACAGA GTAGCATCCT GCGCAGTCTG CAACTTGACG CCTTCAAGGA TCCGGAGCTT
CTCCAACTTT TCAAAGACGC CTCCAGCGCC TTGCGGGAGG AGCGGTACGA GGACGCAGAG
GCCGCAGCCC TCAGCCTGAC AGAAAAGGCA CCAGACGCGC CACAAGGCTG GCACATGTTG
GGAATGGCCC GCGTGAAAAA CGGGTCCACT TCCGGTGCGC TTCACGCCCT AGACGCCGCG
GCAGCGCGTT ACAAGAACAA CTCGGACCCT TTGGTCATCA AGGGAGATGT GCTGCGCGGG
CTGGGTCGGT TTGACGATGC CGAGGCCGCT TATCGTCGCG CCGTCGAGAT TAATCCGGAC
GATCTGCGCG GGGTCGAGGG GCTTGGTGCG ATGTTAGAGG CTGGGGGCAA AGCGGACGAA
GCGATCACCG TCTATCAATC TGCTCTTGCA ACGAACCCTT CGGATATCGC CTTCGCGTTG
AATACCGCGC GCCTCCAGGT GGCACTGGAC GATCTTTCCG CCGCACGGGA GACCCTAAAG
AGGTTCGAGG CCTCAAATCC CGAAAACTCC CAGGTCAAAA TTGCACTCGG GCGAGTTGAA
TTCCTTTCTG GAAACAATGA AGCCGCGACG GCCTATTTCG ATCAGGCGAT CCGCATCGCG
CCCGAGAATT CGACCCAGTA TCTGCTAAAG GCACGCGCCC AGCTGAATGC AACCGAGTTT
GCAGCGGCAG AAGAAACATT GCGCGGCGCG CGTGAGATCT TTCCGGATGA TCCGCAAGTG
CCCTTTGAAC TCGCCAGTCT GTACGGAGTC ACGCGCGACT ACGCCCGGGC TGCCGAAGTG
TTCGAGGCAG GCGCCGAAAG GTGGCCTGAT ATCATGGGAT TCCACTCTGG ACTCCTCAGG
GCCTCTTATC GCCTGAATGA TTTTGATGCG GCCCTGTCCG CTGCAAGAGT GCTTTCAAGC
CAGCCCGGCG CCTCTGCCGT CGACCACATC TGGCATGCAC TCGTCCTCGA GAGGCTCGAG
CGGGTCGATA GCGCGATCAC CTCGTATGAT ACTGCACTGG AGCTTGATCC AACAAACTGG
CTTGCCGCCA ACAATCTGGC AAACCTTTTG TTTGACCGTG CCCCAGACAG GGCATTGGAG
CTTGCACGGC TGGCTCATCG GACAGCGTCC GAAAACATCT CGGTAAACAG GACACTCGCC
TGGGCAGAAT TTTCCGCCGG CAACACCGAG GCCGCCCTCG CGCTCTACGA CTCTCTGACG
CCAGAAGCTT CCGATGATCC GATCCAACTC TTCCGGCACG GGCAGGTTCT GATTGAAGCC
GAAGAGGCCG AAGCAGGCCG CATCCTCATC CAAAAAGCTC TTGCACTTGA TGCTGAATTC
AGGGGAGCAG AAGATGCTCG GAAGCTGCTC GCGGAATGA
 
Protein sequence
MFGSWLSKVL FPITICTLIG GLSLQDAFAR SLQIDDLTRE REQSSILRSL QLDAFKDPEL 
LQLFKDASSA LREERYEDAE AAALSLTEKA PDAPQGWHML GMARVKNGST SGALHALDAA
AARYKNNSDP LVIKGDVLRG LGRFDDAEAA YRRAVEINPD DLRGVEGLGA MLEAGGKADE
AITVYQSALA TNPSDIAFAL NTARLQVALD DLSAARETLK RFEASNPENS QVKIALGRVE
FLSGNNEAAT AYFDQAIRIA PENSTQYLLK ARAQLNATEF AAAEETLRGA REIFPDDPQV
PFELASLYGV TRDYARAAEV FEAGAERWPD IMGFHSGLLR ASYRLNDFDA ALSAARVLSS
QPGASAVDHI WHALVLERLE RVDSAITSYD TALELDPTNW LAANNLANLL FDRAPDRALE
LARLAHRTAS ENISVNRTLA WAEFSAGNTE AALALYDSLT PEASDDPIQL FRHGQVLIEA
EEAEAGRILI QKALALDAEF RGAEDARKLL AE