Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1992 |
Symbol | |
ID | 5712987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2108817 |
End bp | 2109965 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641267916 |
Product | hypothetical protein |
Protein accession | YP_001533332 |
Protein GI | 159044538 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.073088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.53449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTAG CTTATGTCTC GACCGATCCC GGCATATCGC CCACCGGCAC CAAGGGGGCG TCGATCCATG TGCGCGCGAT CCTCGGGGCG TTGCTGCGCA TGGGCGCAGA GGTGACATTG TTCGCGCCCC CGTCCCGCGC GCCGTTGCCG GAGGATTTGG CCGCGGTGAC CTGGGTGCCG CTGCCGAAAC CGGCCAAGGG CGCGCCCGAG GTCCGCGAAC GCGCGCTGAT CGCGGCCAAT GCGCGCCTGG CGCAGGCGAT GGAGGACCAC GGACCTTTCG ATCTGATCTA TGAGCGGCAC GCCCTGTTTT CGGACGCGGC CATGCAATTC GGTGCGGCGC GCCGGATCCC CAGCGTGCTG GAAGTCAACG CGCCGCTTCT GGAAGAACAG CGCCGCCACC GGGTTCTGCA GAATTCGGAC GAGGCGGCGG CTCGTGCCCG GTCCTCGATC TCGGCGGCGG ATCGGATCAT CGCCGTCTCC GATGCGGTCG GCGCCTATGC CGAAGGCTTC GGCGCCCGGT CGGTCAAGGT CGTGCCGAAT GGCGTCGATG CGGACCGCTT TGCGGTGCCA CCCGGGTTCC GGCCGCCTTT CACCCTCGGG TTTGTCGGCA CGCTCAAGCC CTGGCACGAT GTGGCCTGCC TGATCGATGC GCTGACGCTG GTCCGGCGCT CGGTGCGCGA TGCGCGGCTG CTGGTGGTCG GCGACGGTCC GGAGCGCGCG GCCCTCGAGG CGCAGGCGCG CGAGGGCGGT CTTGCCGACG CGGTCGACTT CCATGGCGCG GCGCCGTCGC AGGACATCCC GGCGCTGCTG GCCCGGATGC ATGTGGGGCT CGCCCCCTAT CGCGGGGGGG ATCCGTTCTA TTTCTCGCCG CTCAAGATCT ACGAATACAT GGCGGCGGGC CTGCCCGTTC TCGTCAGTGA CCGGGGCAAC ATGCGCGATG TGGTCCTGCC GCCCCGGGCG GGCGCGGTGG TGCCGCCCGA TGACCCCGCT GCGCTGGCCG AGGCGATCAT CCACCTGGCG CAGAACCCGT CGGTCGGGCG CGCGCAGGGC CAGCGCGGGC GCGCCCATGT GATCCGCACC GCCAGTTGGG ATCACGTCCT GCGGGCGAGC CTGAACGGCC TGCCGCTCCC TTCGGTCCTG GCCGCCTGA
|
Protein sequence | MRVAYVSTDP GISPTGTKGA SIHVRAILGA LLRMGAEVTL FAPPSRAPLP EDLAAVTWVP LPKPAKGAPE VRERALIAAN ARLAQAMEDH GPFDLIYERH ALFSDAAMQF GAARRIPSVL EVNAPLLEEQ RRHRVLQNSD EAAARARSSI SAADRIIAVS DAVGAYAEGF GARSVKVVPN GVDADRFAVP PGFRPPFTLG FVGTLKPWHD VACLIDALTL VRRSVRDARL LVVGDGPERA ALEAQAREGG LADAVDFHGA APSQDIPALL ARMHVGLAPY RGGDPFYFSP LKIYEYMAAG LPVLVSDRGN MRDVVLPPRA GAVVPPDDPA ALAEAIIHLA QNPSVGRAQG QRGRAHVIRT ASWDHVLRAS LNGLPLPSVL AA
|
| |