Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3388 |
Symbol | |
ID | 5712446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3566584 |
End bp | 3568110 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641269317 |
Product | polysaccharide chain length determinant protein |
Protein accession | YP_001534722 |
Protein GI | 159045928 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATA TGCGGTACTA CACATCCGTC TTGTTGCGCC GCTCGCCGTT ACTGATCGTG GTCACCGGCG TCTGCGCAGT GTTATCTGTA CTAGTCGCTC GTGAGTTACC CGCAAAATAC GAAGCCTCGG CAAGACTGTT GGTCGAATCG GCCCAGATCC CGGATCAGCT GGCGATGTCG ACAGCAACTA CCGGCGCGCA GGAGCAGCTG GAAATCATCC AGCAAAGGCT GCTGACGCGG GCCAATCTGC TCGATATAGC AAACCGGTTC GAAGTGTTTC CCGACATTAG GTCGATGAGC CCGGATGAAG TCGTCCGTAG CATGCGGAGC AACACGCGCA TAAACCTGAC ATCCGGTCGT GATCGCGCGA CACTCATGAC GATCTCCTTC ACCGATGATC GTCCTGTGAC GACGTCGGAC GTTGTGAATG AGTACATCAC TCTGATCCAG CGTGAAGATA GCCGCCTCAG AACCAGCCGC GCGTCGGAGA CCGAGCAGTT CTTCAAACAG GAAGTCGATC GGCTCGCCGT TCAACTTGAC GAGAAAAGTG CCGCGATTCG ACAGTTCCGC TCCGAAAACT CCGATGCTCT CCCCGAAACG TTTGAGTTTC GGCTTGAACG GTTAGCGGGT TTGCAGGAGC GATCCACAGC CTTCACCAGG GACCTACTCA TCCTAAAAGA ACAGCGTCAG CAACTCAAAG AAGCTTTGGA CGGTTTGCGG ACCGGGTTGC CCGAAGACGC CACGGAAGAG CTTTCTCAAG AAGAAGAAAA GCTCGTGGAT CTCAGACGAG AGCTTGTGGA GGCAATGGAA ACTGCGGGCG AAACCGCACC GCGGGTTCGA ATTCTGCAAT CCCGGATCGC CCAGGTCGAG CGGGTCATTC AGTCGCTCGG CGGCATCACG GCGATCGAGA ACCCGACCCA GACTCTTCAG GCGCAGATTG AACAGATCAA CGCGCAGATC GGGTTCTTGG AGGAGCAGCT TGGCACCACC CAATCCCAAA TTGAAGACGT GGAAGCCACG CTCGATCTGA CACCTGGCAT CTCGGTGTCT TTGGAGTCCT TGCAACGGGA CTACCAAAAC GTTCAGGAAC AGTACAACTC CGCGGTAGAG CGACTGGCGA TTGCGCAAAC CAGTGAGCGT ATAGAATTGG CATCACGCGG CCAGCGCATC ATCGTTCTCG AACAGGCTGC GACACCGACT GACCCGTCGA GCATGCCACC ACTTCTTGTC GCTGCCGGAG GTACCATCGG CGGTTTCTTT CTGGCTCTGG GGTTGGCAAT CCTCCTGGAG GTCATGAGCA AGACGATCCG CCGACCTAGC GACCTGACCA AAGGTCTTGG CATTGTGCCG CTGGCGACAC TCCCTTACGT CAAGACGAGC CGTGAGATCG TTTTTGAACG GGTGGCCAAA GTTGCAATTA TTCTCGTGAT CGTCACCGGC ATTCCGGCGA TGCTCTACGC GGTCCATATG CTTTATCTGC CTTTGGATCT GCTCGCCGAC AGGATCCTGA CGCGCATTGG TGTCTGA
|
Protein sequence | MFDMRYYTSV LLRRSPLLIV VTGVCAVLSV LVARELPAKY EASARLLVES AQIPDQLAMS TATTGAQEQL EIIQQRLLTR ANLLDIANRF EVFPDIRSMS PDEVVRSMRS NTRINLTSGR DRATLMTISF TDDRPVTTSD VVNEYITLIQ REDSRLRTSR ASETEQFFKQ EVDRLAVQLD EKSAAIRQFR SENSDALPET FEFRLERLAG LQERSTAFTR DLLILKEQRQ QLKEALDGLR TGLPEDATEE LSQEEEKLVD LRRELVEAME TAGETAPRVR ILQSRIAQVE RVIQSLGGIT AIENPTQTLQ AQIEQINAQI GFLEEQLGTT QSQIEDVEAT LDLTPGISVS LESLQRDYQN VQEQYNSAVE RLAIAQTSER IELASRGQRI IVLEQAATPT DPSSMPPLLV AAGGTIGGFF LALGLAILLE VMSKTIRRPS DLTKGLGIVP LATLPYVKTS REIVFERVAK VAIILVIVTG IPAMLYAVHM LYLPLDLLAD RILTRIGV
|
| |