Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2002 |
Symbol | |
ID | 5712997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2121899 |
End bp | 2123377 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641267926 |
Product | hypothetical protein |
Protein accession | YP_001533342 |
Protein GI | 159044548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.583711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.178712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCA TTGCGCTGCT GGCCTGCCTT GCGGGCCCGG CTCAGTCCGA AATCCGGTTC GAGGATCTGT CGGGCACCCT GCCGCCCCAT GTCTATTCCG GCGGGTGGGA GCATTTCGTC GGCGGCGGTC TTGCCGTGCT GGATTGCGAC GGCGACGGGT TGCCGGAGCT GTTCGCCGCG GGCGGCGAGG CCCCGGCGAT CCTGCTGCGC AACCGCGGCG GCATGCGGTT CGCGCCCGGT CTGTTGCCAA AGATCATCGG CGTGACCGGG GCCTATCCGC TCGACATCGA CGGGGACGGA TGGCGCGATC TCTTCGTGCT GCGGGTGGGG CCGAACCTGG TGCTGCGCGG CGGGCCGGAC TGCGCCTTCA CCGAGGCCAC CGCGGAGTTG GGGATCGACC CCGGCGACGC CTGGTCCACC GCCTTCACCG CCTGGGTCGC GCCAGGCGAC GAGCGCCCGA CACTTTTCGT GGCCAACTAT GTCGACCGCG ACGACCCCGA CGGGCCGTTC GAGGCCTGTG ACGACCATCA AGTTCTGCGC CCCGGCCAGG CGGGCGGGTT TCGCAGCGAC AGTTTCGGGC CGGGCTTCTG CACCCTCTCG GCCCTGGCGG CGCGGGATGC GCGCGGGCGG ATGACCCTGC GGCTGTCCAA TGACCGGCAT TACTACGTGC GCGGCGGGTA CGAGCAGATG TTCGACCTGG CCGAGCGGCG GTTTCTGGGC CCGGAGGATG GCTGGCCGCA GGTGGCGCTC TGGGGCATGG GGATCGCCAG CCGGGATCTG ACCGGCGACG GGCGCGACGA GGTGATGCTG ACCTCCATGG GCGACCAGCT GATGCAGATC GCACAGGCCG ACGGGACCTA TGCCGCCGCC CCTTTCGCCA TCGGCACCTA TGCCCAGCGG CCCCACACGG GCGAGGATGG ACGCCCCTCC ACCGGCTGGC ATGCGGAGTT CGGGGATGTG GACAATGACG GACGCGCGGA TTTGTTTCTG GCCAAGGGCA ACGTGGACCA GATGCCTGGG CTGGCGATGC AGGACCCCAA CAACCTTCTG CACCAGCGCG CGGACGGGCG GTTCGAGGAG GTCTCGGTCG CCGCGGGGGT GGCCACCACG GCGCGGTCGC GGGGCGCGGC CCTGGCGGAT CTGGACGGCG ACGGGCGGCT CGACCTCGTG GTGGTGAACC GGCGCGCGCC GATGGAGCTC TATCGCAATG TTTCGCAGCA AACGGGCCGC TGGCTGGCGG TGGACCTGTC AGCGCTCGGG CTGGCGGAGA TCGGCGCACA GGTGACGGTG ATCACGAACG CGGGCGCGCA GGTGCAGCAG CGCCTGATCG GGGGCGGTCA TGCAGGCGGC AGTGCGGCGC CGCTGCATTT CGGGCTGGGC GAAGCGACGC AGGCGCAGGT CGAGCTGCGC GATGCCGCCG ACCGGGTGAT CTGGCAGGGC GAGAGCGCTG CGGACCGGGT GCTGCGCGTG GAGCCCTGA
|
Protein sequence | MRAIALLACL AGPAQSEIRF EDLSGTLPPH VYSGGWEHFV GGGLAVLDCD GDGLPELFAA GGEAPAILLR NRGGMRFAPG LLPKIIGVTG AYPLDIDGDG WRDLFVLRVG PNLVLRGGPD CAFTEATAEL GIDPGDAWST AFTAWVAPGD ERPTLFVANY VDRDDPDGPF EACDDHQVLR PGQAGGFRSD SFGPGFCTLS ALAARDARGR MTLRLSNDRH YYVRGGYEQM FDLAERRFLG PEDGWPQVAL WGMGIASRDL TGDGRDEVML TSMGDQLMQI AQADGTYAAA PFAIGTYAQR PHTGEDGRPS TGWHAEFGDV DNDGRADLFL AKGNVDQMPG LAMQDPNNLL HQRADGRFEE VSVAAGVATT ARSRGAALAD LDGDGRLDLV VVNRRAPMEL YRNVSQQTGR WLAVDLSALG LAEIGAQVTV ITNAGAQVQQ RLIGGGHAGG SAAPLHFGLG EATQAQVELR DAADRVIWQG ESAADRVLRV EP
|
| |