Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0705 |
Symbol | |
ID | 5711140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 711826 |
End bp | 713256 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641266614 |
Product | hypothetical protein |
Protein accession | YP_001532052 |
Protein GI | 159043258 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCAAC GCCCCCAAGC CGTTCCCGAA CCGCTCTGGC GACAGACCGT CAAACGATGT GCCCTGCGGC CGAACGGCAT GGCGACCTCG CTGATGAACG CGATGCGGGC CAAATGCGAT CTGGCGGCGC GTCTGCCGAT GGACGAGGCG TTCGAGGTTT TCTGCGCCTT GTCGGAGCTG GATGCGGCCC ATGCGTTGCG CCGGCGCCCG CTGCGCGATC TCGGTGCGGT GCTGGAGGCG GAGGCGGCCG AGCCGGTCAT CGTGCTGAAT CCCGGCGGTG AGCCGCGGGT GAACCGCGCG CCCGAGATCC ATGGGCCGGG CCATGTACCC GAGCTGCGCA ACACCGGGCG GCGGATCCTG GCGGGATGGC TGGAGGATGC GACGGTCTTT GCCCGCTCGG CGCTTGTGGC GCGGGGAGAG GAATTGCTGC GCGATGCGCA GCACGACGAG TTGACCAGTC TGCCCGAAGA GTTGGCCTTC GATCCGGTCG TCTTTCGCAG GACCGGCGCG GCGGAGGTGG CCTTCATCGA GGATACCGGT CCGGGCCGCT GCCTGCATCT GCCCCGGGCG CTCTCGCTGC TGGGGATCAA CGCGGACTCG TTCGGGCACT GGATGCTGGA GCAGCTGCCG CGGTTTCTGG CGACGCGGGC GCTTCTGGGC GCGGCAACGC CGCCGCTGCT CGTGGATGCC GATATCCCGG CGCAGCATGC GGAGGCGTTG CGCTTCTTCG GCGGGGAAGA CGGCCCCGAG ATCATCGAGG TGCCGCGCGG CCTGCGGGTC CGGGTCGACC GGCTGTTCTG TGTGCTCGAT TGGGCCTATG CGCCGCATCT GATCACCACG GACGAGGGGC TCGACGTCTC GAAGGTGCAT GCGGTGGTCC CGTGGGTTGC CGGGGTTTAC GCGCAGGCCG GCGCGCGGGC CGATGCGCGG ATCGCCGAGC TCGGGGTGGC GGTTGCGCCG CAGATCCGGG CGCAGCGTCG GGTGTTCTGG GCGCGAAAGC CTGCCCGCCA CCGGTCCATC GCCAACTGGG AAGCGCTGCG CGACAGGCTG GACGAGCTCG GCTATGTCAC GGTCTTTCCC GAGGAAATGG GCTTTGCCGC GCAGGTTGCC ACCTTGCGTG CGGCGGATCG GATCGTGGTG CAGAACGGGT CCGGGTCGCT GGGGCTGCCG CTGGCCCGGC CGGGGACGCG GGTGCTCTAT CTCAGTCATC CGGAGATGTC GCGCTTTGCC TGGCAGTCGG AGGCCTTCGC CTGCCTCGGG TTCGACCTGC GGGTGCTGTC GGGGCCGTTC ACCGAAAGGT CACAGCCCTG GGTGGACCAG TCGAACTACG AGATCGAGAT GCCGGTCGCG GAAGCCGCGC TCGCCTGGAT GGAGGCGGAC CTACCGCACG GGCGCGGCCG TGCGATGCAG ACCGCGAGAC CGCGCCCATG A
|
Protein sequence | MIQRPQAVPE PLWRQTVKRC ALRPNGMATS LMNAMRAKCD LAARLPMDEA FEVFCALSEL DAAHALRRRP LRDLGAVLEA EAAEPVIVLN PGGEPRVNRA PEIHGPGHVP ELRNTGRRIL AGWLEDATVF ARSALVARGE ELLRDAQHDE LTSLPEELAF DPVVFRRTGA AEVAFIEDTG PGRCLHLPRA LSLLGINADS FGHWMLEQLP RFLATRALLG AATPPLLVDA DIPAQHAEAL RFFGGEDGPE IIEVPRGLRV RVDRLFCVLD WAYAPHLITT DEGLDVSKVH AVVPWVAGVY AQAGARADAR IAELGVAVAP QIRAQRRVFW ARKPARHRSI ANWEALRDRL DELGYVTVFP EEMGFAAQVA TLRAADRIVV QNGSGSLGLP LARPGTRVLY LSHPEMSRFA WQSEAFACLG FDLRVLSGPF TERSQPWVDQ SNYEIEMPVA EAALAWMEAD LPHGRGRAMQ TARPRP
|
| |