Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2556 |
Symbol | |
ID | 5713453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2720713 |
End bp | 2721852 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641268479 |
Product | hypothetical protein |
Protein accession | YP_001533890 |
Protein GI | 159045096 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000358753 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGGAGC TGATCAAGAC CCAGAGCCGT GCGCCCGGCG AGTGCCTGAT CCATATCGGG GACGCGGAGA TCGTCGATCT CTACCCGTTT CTGATGGAGG TGACGGTCGA TACGGCGCGG GAGGCGGCCT CCGAAGCCGT CCTGAAATTC GAGACGCGCC GGGATCTCGA CGGCAGCTGG ATCGTGCAGG ATGACGACCG GATCCGTCCC TGGAAGCCGC TTCGGATCGA GGCCGCGTTC GGGGACGAAA CGGAAGAGGT CATGCGCGGC TATATCCGCC AGATCGATGT GTCCTTCCCC GAGGATACAG GGGGCGCGAC GGTCACGTTG GCGGTGCAGG ATGACAGCCT CGCCCTTGAT CGCACCGCGC GAACCGAGGC GTGGGGCGCG GAGGGGCGGA CCACGGATCA GGCCATCGTC ACGGACATCC TGTCGCGCAA CGACCTTGTC CCGGAGGGCA TGCCGGCCCT GGGCCAGACC GACCTCGCGG TGACCCAGGA CGACACCGAT GCGGCCTTTC TGCGCAAGCG GGCGGAGGTG AATGGTTTCG AGCTGATCTA CCGCCGGGGC GCGGTCTATT TCGGACCCCG GCGCTTGACG GCGGCGCCGC AGGCCACGGT GAAGGTCTAT GCCGGGCCGG ACACCACCTG TTTGAGCTTT GCGGTCACCG ATGACGGGAT GAAGCCCGAC GGGGTGGAGT ATGACGTGGC GAGCGCCGAG GGCGCCAGGA CCGAGACCCG GCGTCTGGCT CCGAACCTCG ATGCGCTGGG CCCGGAGCCA GCAAGCTCGG TCGCGGCGCT GGACGACGGG TTCGTCTGGA AGATCCGCAA GGAGGGTGAG AGCGACGCGG CCAAGGCCGA AACCCTTGCG CAGGAGAAGG CCAACGCCAA TGCCATGAAG ATCAGCGGCA AGGGTGTTCT GGATGGCGCG CTCTATGGTC ATGTGCTGCT GACCGGGCTG CCCGTGGGCG TGGACGGGGT GGGCAACCGC CATTCGGGCA TCTGGTACGT GGACCGGGTG CGCCACGTGT TCGACACCAC GGGCTACCGG CAGGAGTTCG AGTTGCAGCG CAACGCCTAT GGCGACAACC TGCCCGAAAC AGGCGATCCG CTGGCGCGGC TGCGGGGGGT CGGCACATGA
|
Protein sequence | MLELIKTQSR APGECLIHIG DAEIVDLYPF LMEVTVDTAR EAASEAVLKF ETRRDLDGSW IVQDDDRIRP WKPLRIEAAF GDETEEVMRG YIRQIDVSFP EDTGGATVTL AVQDDSLALD RTARTEAWGA EGRTTDQAIV TDILSRNDLV PEGMPALGQT DLAVTQDDTD AAFLRKRAEV NGFELIYRRG AVYFGPRRLT AAPQATVKVY AGPDTTCLSF AVTDDGMKPD GVEYDVASAE GARTETRRLA PNLDALGPEP ASSVAALDDG FVWKIRKEGE SDAAKAETLA QEKANANAMK ISGKGVLDGA LYGHVLLTGL PVGVDGVGNR HSGIWYVDRV RHVFDTTGYR QEFELQRNAY GDNLPETGDP LARLRGVGT
|
| |