Gene Dshi_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4071 
Symbol 
ID5714623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009958 
Strand
Start bp3168 
End bp6497 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content56% 
IMG OID641276980 
Productparallel beta-helix repeat-containing protein 
Protein accessionYP_001542276 
Protein GI159046607 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.041867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTT CCTCCTACTT CAAGGATTAC ACTGCCCATC AAAACGCCAC CATCCTTGTT 
TCCAGCACAG AAGAACTCAA TGCCGCTGTG GCGGAGCTTG CTGGCACGAC CGGGGGTACG
GTTCTGCTCT CGGGTGAGGC TGGTCCCTAC CGATTGGATG CGCGCAACCT CGGCGATAGT
GCCGAGAGCG CGGTTCTGAT CACCTCCCAA GACTCGAACA ACCCCGCGCA ATTAAAGCAG
GTCTATATCG ACAACAGTGT CTATCTCAGC CTGACCGATG TGACGATCGA TAGCGGGGAT
TACGGATTCG AGCGGTCAGG CCATCTCCAT GACATCTATG TAAAGTCCGG AGCGCATCTG
CAATTCGTCG ATCTGACCAT GACCAGCACT GCCGAAGCCC CCCTCGGCTT CGATGACGAC
ACCGTGAAGG CCGAGGATGC CGTCTATCTG CGCAATGCCT CGGACGTTCT ATTCGCCGAC
AGCACGATCT CGAATTACTA CCACGGTATC AGTTTGGCAG GGGTACGCGA TACGATCGTA
ACCGGCAACG ACATCTCCGC CTTGCAGGGG GACGGTATTC GCGGCGGCGG CGTGAACAAT
GTTCTGGTCT CGGACAATCA CATCCATGAT TTCTTGGGTG CGACCAACGA GTACAACCAC
CCAGACATGA TCCAGTTCTG GACCGGGTTC CCAGAGGGGA TGACCAACAC CGACCTGACG
ATTACCGGTA ACCTGCTCAA TGCGGGCGAA GGCTCCGCTG CTCAGGGCAT TCTCGTACAG
AACGAGGCCT TCGATGACGC GGGAGACCCT TTGGAGGGAG TTTATGGTCA AAACCTGACG
ATCACGGACA ATGTCATCCA TTCGGGCATG TCTAATGGCA TCGGGATCAT CGGATACCAA
GGGGTCGAGG TTGCCAACAA TTCGGTGCTT TGGAACGAGG GCGCCGTGCT GTCCCAGACG
GCTGATGCCG TACCGGCCTC CAACCCTCCC TGGATCTTGG TACGCAACAG CTTGGAAGTC
GAGACCGCTG GCAATGTCGC TCACTTGGTC CGGATCGAAA CCGAGGACCA GGCCGAGACC
AATTACTTTC TGGACTACGA TAATCCGAGT GCGCCGAATT ATGCCTATGC CCATGTGATC
GGCCTTGCCG CCGATGGCCA GACGGACGTC TACGACTTGC AATTCCGGCC CGATAGCCCA
CTCAACGGGG TCTCTGGCGC TGAGGCCAGC ACTTGGACGC CCAGCGAGGC CCCGGTGAAC
GCCGTGATCT CCCAGCAGCC CAGTCTCGAG GACCGCGCAG CGGTGGAGTT GTCGGCCCGC
TATTCGACTG TGGATGGACA ACTGGTTGAT CCCGAGACTG CACAAGTCTT CTGGACCTTG
GCCGATGGCA CCGTCCTGGA GGGCCTGGAG GTCACGGTAA CTTTCGACAC GCCGGGGCTG
CACGAGGTCA CGCTCGATGT CCTTGCGCCT GATGGCAGTT CCGACAGCGC CACGAGATCC
GTCGACATCC GCGGAAACGG CCTGTTCGAG GCGGATTTCG CCCGTTCCGA AACCTTTGGC
GACGGTGTCG AGATTGTCGA TCCCAATGGC GAAGCGTTCT CCAAGAAGGG CGTGTTCACT
CTGGATGGCG ACAGTGTTTT CGAGGTGACC CGCGGAAACC CAGAGTTCGA GACGCTTCAT
ACCCTGTCCA TGGACGTCTC ATTCAAACCG GACGCGTTCG AAAAAAACGG CTGGCTGGTG
AAATGGCTCA AGGCGGTCGA TCTGCGGGTG ACAGACGAGA AGGGGCTATG GCTCAAAATC
GAGACCGATG CCGACGTTTA CGAGATCCGC ACCGAGGGTA ACCTTCTTGA GAAGGGCGAA
TGGGCTGACA TCGCGGTCGA TTTCGACAGC CATGCGGGGC AGATGCGCTT GTCTCTGGAC
GGCGAGGAAC TCGGCCGGAC CGACGTCGAA GGCACCATCT CGGGGTCGCA ATACGACCTC
ACCTTGGGAG AGGGCCGGGG GCGCAGCGCC GAAGGGGAGA TCGATTACTT CTCGATGACC
ACCCCGCCTG CAGGATCGGT GCTTGAGATC GGAGACCGGT TTCCGGAGGA ACCAGAAGAG
CCCGAAGACG AAACACCCCC GCCGGTGGGT GTGGACGATA GCTTCATCAC GGACGAGGAT
GTAAAACTCT CGGGCGATGT CTTGGATAAC GATATCAATG GCCAAGGCGC TATCCTAACG
GTGAGCCTAC TCAGCGATGT GTCCAACGGC ATACTACTGC TGAACAACGA CGGTACCTTT
GATTACACCG CCGCACCGGA CTTTAACGGC TCCGATAGCT TTACCTACAC AGTGTCGGAT
GGGTCGTTCA CTGATACGGC CACGGTCTCG ATCACAGTCA ACTATGTGAA CGACAATCCT
GTCCTGACCG CAGACACGGT GACGACGAAG GAGGACATCT CGGTTAATAT CGACGTGGTG
GTAAATGACA CAGATGTCGA TGGCGACACG TTGAGCGTGT CGGTGGTCGG CGCGGCTTCG
AACGGAACTA CATCAATAAA CCTCGACGGA ACAGTATCTT ATACTCCTAA TCTCCATTTC
TTCGGCACCG ACAGCTTTAT TTACACAGCC TACGACGGAC ATGGCGGCGT CAGGTCGTCA
TCGGTTACCG TCACTGTCGA TGCGGTTAAC GACGCGCCGG TGATCGAAAT CTCCAACTTC
GCGGTCGACG AAGAACAGAC AACTGTCGGT AAGATAATCG CTTCTGACGT CGAAGACCAC
GATTTAGACT TTGCCATCTC GGGCGGCGCG GATGCCGACC TCTTCGACAT TACGGAAGAC
GGCGAACTCA GCTTCCTCGT TGCTCCGGAT TTTGAAGCGC CATTAGATGT TGGTGGTACC
TCTGGCGACA ATATCTACGA AGTCGAGGTA TCGGCCTTTG ACACTAGGGG TGCGTTGAGC
AGTGCACTCT TCAACGTCGC AGTAAAGGAT GTAGATGAGG GCGTCAGGCC GATCGAGGTC
CTCGGGACGG GGGACAACGA CGTGCTGACC GGAACCGTGG CAGACGAGCG CCTCATCTCA
CATGGCGGCA GAATAGACCG GATGACCGGC GGTGGGGGTG CAGATGAATT TGTCTTCGGC
CCTGAACTCG ACAACAACCA ACGCGAACTC GATATTATCT ACGACTACGA CACCGATGAT
ACGATCATAC TGGGCTCTGA TGACTATCGA TTAATTTCTC TCGGAAATAG TGTATTGATC
CATCACAACG ACGACGGCGA CTTGATCTAT GTGCTAGGAA CAGAAGCAGA GAGTTTAATG
ATCGAAATTG AAAACTCCGC CGTTCTATAA
 
Protein sequence
MSGSSYFKDY TAHQNATILV SSTEELNAAV AELAGTTGGT VLLSGEAGPY RLDARNLGDS 
AESAVLITSQ DSNNPAQLKQ VYIDNSVYLS LTDVTIDSGD YGFERSGHLH DIYVKSGAHL
QFVDLTMTST AEAPLGFDDD TVKAEDAVYL RNASDVLFAD STISNYYHGI SLAGVRDTIV
TGNDISALQG DGIRGGGVNN VLVSDNHIHD FLGATNEYNH PDMIQFWTGF PEGMTNTDLT
ITGNLLNAGE GSAAQGILVQ NEAFDDAGDP LEGVYGQNLT ITDNVIHSGM SNGIGIIGYQ
GVEVANNSVL WNEGAVLSQT ADAVPASNPP WILVRNSLEV ETAGNVAHLV RIETEDQAET
NYFLDYDNPS APNYAYAHVI GLAADGQTDV YDLQFRPDSP LNGVSGAEAS TWTPSEAPVN
AVISQQPSLE DRAAVELSAR YSTVDGQLVD PETAQVFWTL ADGTVLEGLE VTVTFDTPGL
HEVTLDVLAP DGSSDSATRS VDIRGNGLFE ADFARSETFG DGVEIVDPNG EAFSKKGVFT
LDGDSVFEVT RGNPEFETLH TLSMDVSFKP DAFEKNGWLV KWLKAVDLRV TDEKGLWLKI
ETDADVYEIR TEGNLLEKGE WADIAVDFDS HAGQMRLSLD GEELGRTDVE GTISGSQYDL
TLGEGRGRSA EGEIDYFSMT TPPAGSVLEI GDRFPEEPEE PEDETPPPVG VDDSFITDED
VKLSGDVLDN DINGQGAILT VSLLSDVSNG ILLLNNDGTF DYTAAPDFNG SDSFTYTVSD
GSFTDTATVS ITVNYVNDNP VLTADTVTTK EDISVNIDVV VNDTDVDGDT LSVSVVGAAS
NGTTSINLDG TVSYTPNLHF FGTDSFIYTA YDGHGGVRSS SVTVTVDAVN DAPVIEISNF
AVDEEQTTVG KIIASDVEDH DLDFAISGGA DADLFDITED GELSFLVAPD FEAPLDVGGT
SGDNIYEVEV SAFDTRGALS SALFNVAVKD VDEGVRPIEV LGTGDNDVLT GTVADERLIS
HGGRIDRMTG GGGADEFVFG PELDNNQREL DIIYDYDTDD TIILGSDDYR LISLGNSVLI
HHNDDGDLIY VLGTEAESLM IEIENSAVL