Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1797 |
Symbol | |
ID | 5712785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1868767 |
End bp | 1870593 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641267717 |
Product | hypothetical protein |
Protein accession | YP_001533140 |
Protein GI | 159044346 |
COG category | [S] Function unknown |
COG ID | [COG2861] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000479521 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.132264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGAG GCATACTCTC GGGCCTGTTC TGGGGCGGTG CGGCCAGCAT GATCGTGCTG ATGGCCGCGT CGGTTTATTT TCCCCTGCGG GACATCAGCG ACCGCACCGC GCCGCGCGCG CCGGTGCGCG AGAACGTGTT GCTCGCCGAC CCGCCCCCCC TCCCGCCGGG GGCGGAGGTG CAGCCGAGCG TTGCGGAGCC TGAACCTTCC GTCACGACCG ATCGTGTCGC GGCCCCGTTG CAGACCCCGC GTCCAGACAG CGGCCCGCCG CCCGATGCCC CTGCACCGGC CGCGCCGACG GTTGCCGAGG CGCCGCCCGC GCCGCCGACG GACCCCGCCG CGCCCGCTGG CCCGGGCCCG GAACTTGCCG AGACCGCCCC GGCGAATGTT GCGCAGCCCG CGCCCGGCCA AGCGGTCCCA ACCGCGCCTG GGCCCGATAC CCCGTCGGCC ACCGCGCCGC GGATGGCCGA CGTGCCGCCG GGCCTGCCGG TGCCCGGTGC CGTGCGCGAC CGCGCCCCGG GAGAGGCGCT CGCAGCCCTG CAAACCCCTT TGCCGGTCCC CGAGGTTGCC CAGGAGCCGG GCGATTTGAC GGGCCCTGCC GCGATCCTGC CCGCCCCCGG CGCGCGGCAA ATCGGCGAGG CGGCGACGCG GACGCCGGAC CGGCCCGCGG ATCTGCCTGT GCCTGGCACG GCCGCCGAAC CACCGGTCGC GTCGGCGCCT GTCGCGCCGC CCCGGCCGCC GGAGCTCGCC GCGGAGCCAT CGATGCCGTC CGGCCCGCCG GAGCCGGTCG AGCTTGCCCG TCCCGACAGC TCGCCGCCGC GGCTGGCCCT GGCACCCGCA GACCTGCCGC TGCCTCCCGT GGCCGCGGCC CCCATGGAAG CGTCGGAGCC GCCCGTCCCG GCCGCAGGCC CTCTGCCCGC CGACCTGCGC CCGGCCGCCG ACACGCCTGC ACCGCCCGCC GACGACCGGC GCACGGCCCT GGTCGTGCCG CCGCGGGAGC TGCCCCGCCG TCTGGTGCTC GGCAGCGACC AGAGCTTCGG CACCCGCGGC CCTGGCCTGT CCTCGCGGAT CCCCCGGATC GGCGAGACCG CCGCGGTGGC CGAGGCCGAC GATCCCCTGA CGCCAGAGCC GGAGGTGCCC GCGCCGCTTG GTGCCCTGGC CCGAAATGCG CTGCCTTTCG AAGGGGCGGA GGGTGTGCCG CGGCTCGCCC TGGTGCTGCG CGCGACCTCT GACGCGCGGG CGATCACCGA TGTGCTGAGC CGGATCGCCG ATCCCGTGGC CGTGGCGCTC GACCCGACAT GGCCGGAGGC GGATGCGCGC GCGGCCGAGC TGCGCGCGGC GGGGCACGAG GTGCTGATCA CCCTGACGGG GCTACCCGAC CCGGTCGAGC CGCGCGACAT CGATACCGCC CTCGCGGTCC ATATCGCGCG CCTGCCGGGC GCGATGGGGG TCTGGCTGCC CCGGACGAGC CCGGTCTTCG GCGATCGGGA ACTGCTGCGC CACCTGGTGG CGGTGTTGGG CGACACGGGC CACGGGCTGG TGGCGCCGCT CAGCGGGCTC GACGCGGTGG GGCAGGAGGC GCGGGCGATC GGTCTGCCCG CGATTTCGGT GGGCCGGGTC CTTGGCGGGT CCGGAGAGGG TGAGGACGCC CTGCGCAGAA GCCTCGACCA GGGGGCGTTG CGCGCCGGCG CGGACGGGCA GGCGGTCCTG CTGGGCGAGA CCCGGGCCGA GACGCTGTCG GCCCTGCGCG ACTGGAGCGC CGCGCAGGAC CCGGATGCCT TGCGCCTTGC ACCGATCTCC GCGCTTTTGC TGGCCCCGGG GTCCTAG
|
Protein sequence | MGRGILSGLF WGGAASMIVL MAASVYFPLR DISDRTAPRA PVRENVLLAD PPPLPPGAEV QPSVAEPEPS VTTDRVAAPL QTPRPDSGPP PDAPAPAAPT VAEAPPAPPT DPAAPAGPGP ELAETAPANV AQPAPGQAVP TAPGPDTPSA TAPRMADVPP GLPVPGAVRD RAPGEALAAL QTPLPVPEVA QEPGDLTGPA AILPAPGARQ IGEAATRTPD RPADLPVPGT AAEPPVASAP VAPPRPPELA AEPSMPSGPP EPVELARPDS SPPRLALAPA DLPLPPVAAA PMEASEPPVP AAGPLPADLR PAADTPAPPA DDRRTALVVP PRELPRRLVL GSDQSFGTRG PGLSSRIPRI GETAAVAEAD DPLTPEPEVP APLGALARNA LPFEGAEGVP RLALVLRATS DARAITDVLS RIADPVAVAL DPTWPEADAR AAELRAAGHE VLITLTGLPD PVEPRDIDTA LAVHIARLPG AMGVWLPRTS PVFGDRELLR HLVAVLGDTG HGLVAPLSGL DAVGQEARAI GLPAISVGRV LGGSGEGEDA LRRSLDQGAL RAGADGQAVL LGETRAETLS ALRDWSAAQD PDALRLAPIS ALLLAPGS
|
| |