Gene Dshi_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3385 
Symbol 
ID5712443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3561914 
End bp3564349 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content60% 
IMG OID641269314 
Producttetratricopeptide 
Protein accessionYP_001534719 
Protein GI159045925 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTT TTGGAACGGC CCGCTTGATG TTCGGCGTGC TTGGTGTGGT TGCACTCGCT 
GCGTGCGACA CGGCTGAGGA GCGCGCGGAA GGCCACTACC AAAAAGGCAT CGAACTCCTT
GAGGCTGGAG ATGTCGATCG CGCGCTTGTG GAACTGCGCA ATGTTTTCCA ACTCAACGGT
CTGCACCGCG ACGCCCGGGC GCTCTATGCG GCGACTGTTC TTGACCAGGG CAGGATCAGT
GAGGCTTTCG GGCAATACCT GAGACTGGTG GAGCAATACC CGGACGATCT GCAAGCGCGC
ATCGTTCTCG CCAACCTATC TATCGACGGA CGTCAGTGGG AAGAAGCAAC GCGTCATGTC
AACCGCGCCA GGCAAATTGC AGCGGACGAT TTCGAGGTTC AGGTCCTCGA TCTTGTCATG
CGGTATCGCG AAGCGGGCTT GAACGGCGAC GATGACACGC GCAGGACCCT GCTCACCGAG
GCGGAGGCCA AGTTGGCGCA GGCGCCAGAG AAACTCACGT TGCACAACAT TCTGATCGAC
GGTGCCATCT GGCAACAGGA TTTCGATCTT GCGCTACGGC GTGTCGATGC CGCGCTGGAA
TACAACAAGG AAGCGGAGCA GCTCTACATG TTTCGCCTCG CGGCGCTCGA ACAGCTCGGG
CGCACCGTCG AAATCGAAGA ACAGCTTCTC GAACTTGTTG CTCAGTATCC AGATGTGGAT
CGCTACAGGC GTCTCGTCTT GCAGTTCTAT CGCCAGCAGA ACGAGCCTGA AAAGGCCGAG
TCGTTCATCC GTTCCGTGGT TTCCCCGAGC GACGAAGATC CGACGGAGTA CGTGGCCTTC
GTGCGTTACC TGCGGGACAC TCGTGGCATT GAAGCGGCCC TTTCCGAGCT CGAGTTGGCG
AATGAGACAG TGCCGGATCG TCCCGTTCTT CAGGCGCTCG AAGCCTCGCT GATCTTCGAT
CTCGGGCGGC GGGATGAGGG GATCTCCAAG ATGCAGGCCA TCGTGGATGC TGCGAACGAG
TCCTCCGATA CGAATCGTTT CAAGGTTGCC TTGGCTCAGA TGCTTCTCGC GACTGGAAAC
GAGGTTGGCG CGCGTCAGCT CGTCGAGGAA ACTCTGGTGG AAGACCCGAC CAACGTGGAT
GCGATTAAGA TGCAGGCGAG CTGGCTCATC GACAGGGATG AGACCGAGCG CGCCATCAGC
TTGCTGCGGA CCGCGTTGGA CCAGACCCCT GACGATCCTC AACTGATGAC TCTGATCGCC
AACGCGCATA TGCGCAACGG GGACCGGGAA CTCGCGCGGG ATCTGCTGAG TCTGGCGGTG
GAGACCTCCA GCTATGCGCC GGAAGAGAGC TTGCGCTACG CCCGTTTGCT GGTGAGCACC
GAAGAGTTTC TGACCGCCGA AGATGTGCTC GTAAATGCCC TGCGCCGCGC GCCGTCGAAT
ACGCAACTTC TTTTGGCCTT GGCCGAGGTC TACTTGCAGT TGGAAGATTG GGCGCGCGCT
GAGCATGTCG AAACAACGTT GCGTGGCCTC GATCAACCGG GGGTGCGTGA AGCAGCCGAC
AGCATTCGTG TCGCGATCCT GAACGGTCAG CAGCGTCAGG GCGAAGCGCT CGACTTGCTG
TCGAACCTGG CGACGCAGGA TGGCGGAAAC TTCACAGCCC TGACAGCCAC GGTTCAGTCC
TTGTTGAATG CGGGCGAAGC GGAGCGCGCG CGCGCCATGG TCGATGAGGC CTTGGCGCAA
GACCCGGACA GTTTCCAGGT TCTCGTTCTC GATGCGGCGC TTAAAAATTC CGTGGGCGAC
TTCGAGGGTG CTGCGAAGTC TTATCGCGCT CTGTCTGCGC GCCAAGAGGC CGGAGAGAGG
ATCTGGTTGG AACTCGTGCG TACCTTGAAC CGTTTGCAAC GCCCTGGCGA GGCTCGCGCC
GCGCTCGCCG AAGGCTTGGA GCGTTTTCCG CAGGGCGCAA ACCTGCTCTG GGCTCAAGCC
TCGACGCTTG AACAGGCGGG CGATATAGAC GGCGCGATCG AGATCTACGA GCGTCTTTAC
GAGCAGTCCA GCGGATCCAT CGTCGTCTCT AACAACCTCG CGAGTCTGCT GTCGACACAC
CGGACGGACG AGGCGAGCAT CGAGCGCGCT TACCGGATTG CCCGTCGTCT GCGCGGCACC
GAGAACCCGG CGTTCCAGGA TACTTATGGC TGGCTTGCAT ACTTGCGCGG TGATTACGCA
GAGGCTGTCG AGTATCTCGA ACCTGCGGCC GCGGCCCTGT CGTCGGATGC GCTTGTACAG
TATCATCTCG CCATGGCTTA TCTCGCGGCA GAGCGGACAG AGGATGCCGC AGAGCAATTC
CGCAAGTCAC TGGCGCTTGT GGGGCCAGAC GAGACCCGGC CTCAGTTCGC CACCGCTCGA
GAAGAGTTGA ACAAACTCGA GACGGTTAAC CAGTGA
 
Protein sequence
MRFFGTARLM FGVLGVVALA ACDTAEERAE GHYQKGIELL EAGDVDRALV ELRNVFQLNG 
LHRDARALYA ATVLDQGRIS EAFGQYLRLV EQYPDDLQAR IVLANLSIDG RQWEEATRHV
NRARQIAADD FEVQVLDLVM RYREAGLNGD DDTRRTLLTE AEAKLAQAPE KLTLHNILID
GAIWQQDFDL ALRRVDAALE YNKEAEQLYM FRLAALEQLG RTVEIEEQLL ELVAQYPDVD
RYRRLVLQFY RQQNEPEKAE SFIRSVVSPS DEDPTEYVAF VRYLRDTRGI EAALSELELA
NETVPDRPVL QALEASLIFD LGRRDEGISK MQAIVDAANE SSDTNRFKVA LAQMLLATGN
EVGARQLVEE TLVEDPTNVD AIKMQASWLI DRDETERAIS LLRTALDQTP DDPQLMTLIA
NAHMRNGDRE LARDLLSLAV ETSSYAPEES LRYARLLVST EEFLTAEDVL VNALRRAPSN
TQLLLALAEV YLQLEDWARA EHVETTLRGL DQPGVREAAD SIRVAILNGQ QRQGEALDLL
SNLATQDGGN FTALTATVQS LLNAGEAERA RAMVDEALAQ DPDSFQVLVL DAALKNSVGD
FEGAAKSYRA LSARQEAGER IWLELVRTLN RLQRPGEARA ALAEGLERFP QGANLLWAQA
STLEQAGDID GAIEIYERLY EQSSGSIVVS NNLASLLSTH RTDEASIERA YRIARRLRGT
ENPAFQDTYG WLAYLRGDYA EAVEYLEPAA AALSSDALVQ YHLAMAYLAA ERTEDAAEQF
RKSLALVGPD ETRPQFATAR EELNKLETVN Q