Gene Dshi_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3388 
Symbol 
ID5712446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3566584 
End bp3568110 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content56% 
IMG OID641269317 
Productpolysaccharide chain length determinant protein 
Protein accessionYP_001534722 
Protein GI159045928 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATA TGCGGTACTA CACATCCGTC TTGTTGCGCC GCTCGCCGTT ACTGATCGTG 
GTCACCGGCG TCTGCGCAGT GTTATCTGTA CTAGTCGCTC GTGAGTTACC CGCAAAATAC
GAAGCCTCGG CAAGACTGTT GGTCGAATCG GCCCAGATCC CGGATCAGCT GGCGATGTCG
ACAGCAACTA CCGGCGCGCA GGAGCAGCTG GAAATCATCC AGCAAAGGCT GCTGACGCGG
GCCAATCTGC TCGATATAGC AAACCGGTTC GAAGTGTTTC CCGACATTAG GTCGATGAGC
CCGGATGAAG TCGTCCGTAG CATGCGGAGC AACACGCGCA TAAACCTGAC ATCCGGTCGT
GATCGCGCGA CACTCATGAC GATCTCCTTC ACCGATGATC GTCCTGTGAC GACGTCGGAC
GTTGTGAATG AGTACATCAC TCTGATCCAG CGTGAAGATA GCCGCCTCAG AACCAGCCGC
GCGTCGGAGA CCGAGCAGTT CTTCAAACAG GAAGTCGATC GGCTCGCCGT TCAACTTGAC
GAGAAAAGTG CCGCGATTCG ACAGTTCCGC TCCGAAAACT CCGATGCTCT CCCCGAAACG
TTTGAGTTTC GGCTTGAACG GTTAGCGGGT TTGCAGGAGC GATCCACAGC CTTCACCAGG
GACCTACTCA TCCTAAAAGA ACAGCGTCAG CAACTCAAAG AAGCTTTGGA CGGTTTGCGG
ACCGGGTTGC CCGAAGACGC CACGGAAGAG CTTTCTCAAG AAGAAGAAAA GCTCGTGGAT
CTCAGACGAG AGCTTGTGGA GGCAATGGAA ACTGCGGGCG AAACCGCACC GCGGGTTCGA
ATTCTGCAAT CCCGGATCGC CCAGGTCGAG CGGGTCATTC AGTCGCTCGG CGGCATCACG
GCGATCGAGA ACCCGACCCA GACTCTTCAG GCGCAGATTG AACAGATCAA CGCGCAGATC
GGGTTCTTGG AGGAGCAGCT TGGCACCACC CAATCCCAAA TTGAAGACGT GGAAGCCACG
CTCGATCTGA CACCTGGCAT CTCGGTGTCT TTGGAGTCCT TGCAACGGGA CTACCAAAAC
GTTCAGGAAC AGTACAACTC CGCGGTAGAG CGACTGGCGA TTGCGCAAAC CAGTGAGCGT
ATAGAATTGG CATCACGCGG CCAGCGCATC ATCGTTCTCG AACAGGCTGC GACACCGACT
GACCCGTCGA GCATGCCACC ACTTCTTGTC GCTGCCGGAG GTACCATCGG CGGTTTCTTT
CTGGCTCTGG GGTTGGCAAT CCTCCTGGAG GTCATGAGCA AGACGATCCG CCGACCTAGC
GACCTGACCA AAGGTCTTGG CATTGTGCCG CTGGCGACAC TCCCTTACGT CAAGACGAGC
CGTGAGATCG TTTTTGAACG GGTGGCCAAA GTTGCAATTA TTCTCGTGAT CGTCACCGGC
ATTCCGGCGA TGCTCTACGC GGTCCATATG CTTTATCTGC CTTTGGATCT GCTCGCCGAC
AGGATCCTGA CGCGCATTGG TGTCTGA
 
Protein sequence
MFDMRYYTSV LLRRSPLLIV VTGVCAVLSV LVARELPAKY EASARLLVES AQIPDQLAMS 
TATTGAQEQL EIIQQRLLTR ANLLDIANRF EVFPDIRSMS PDEVVRSMRS NTRINLTSGR
DRATLMTISF TDDRPVTTSD VVNEYITLIQ REDSRLRTSR ASETEQFFKQ EVDRLAVQLD
EKSAAIRQFR SENSDALPET FEFRLERLAG LQERSTAFTR DLLILKEQRQ QLKEALDGLR
TGLPEDATEE LSQEEEKLVD LRRELVEAME TAGETAPRVR ILQSRIAQVE RVIQSLGGIT
AIENPTQTLQ AQIEQINAQI GFLEEQLGTT QSQIEDVEAT LDLTPGISVS LESLQRDYQN
VQEQYNSAVE RLAIAQTSER IELASRGQRI IVLEQAATPT DPSSMPPLLV AAGGTIGGFF
LALGLAILLE VMSKTIRRPS DLTKGLGIVP LATLPYVKTS REIVFERVAK VAIILVIVTG
IPAMLYAVHM LYLPLDLLAD RILTRIGV