Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2903 |
Symbol | |
ID | 5710754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3059093 |
End bp | 3061177 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641268829 |
Product | hypothetical protein |
Protein accession | YP_001534237 |
Protein GI | 159045443 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.177101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCG GTACGGTGAT CTTCGACCCC TATCTTCCCT GGGCGGTGCT GGGCGCGCTC GGCGTGCTGA TGGCGGCGCT TCTGCTCTTT GCGGGGGCGC GGGGGCTTCT GGGCGCCTGG GTGCGCGGGC TGGCGACGGC GGCGCTGCTC CTGGCGCTGG CGAACCCGTC CTTGCAGGAA GAGGATCGCA GCCCGCTGAC CGATATCGTG CTGGTGGTGG TGGACGAGAC CGCGAGCCAG CGGATTTCCG ACCGTACCGA GCAGACCGAG GCGGCGCTCG AAGCGCTGAG CGCCGAGATC ACCGCCCGCA CCAACACCGA GCTGCGCGTG GTCCGGGTGG GCGACGGGCC GGGCAATGCG GGTACGCTGC TGGTCGCGGC GATGACCGAG GCGCTGTCGG AGATCCCGCA GGCGCGGCTG GCGGGCACGA TCCTGCTGAG CGACGGGCAG GTCCATGACC TCGGCCCGAT GCCCAACCTG CCGGCGCCCT TGCACCTGCT GCTGACCGGG CGGGAAGGCG ACTGGGACCG CCGCCTCGTG GTCAAGACCG CGCCCGCCTT CGGCATCCTG GGCGAAGAGG TCGAGCTGAC CGTGCGGATC GAGGACCAGG GCGACGTGCC CGCCGCCGCC CAGGGGCAGG TGTCGCTGGA GATCGCCGTG GACGGGGGGG AGCCCTTTGC CGTGGACGTG CCTGTGGGCC GGGACATCAC GGTGCCGCTG TCGCTCGACC ATGGCGGCAT GAACGTGGTG CAGCTGACCG TGCCCGAGGC CGAGGGCGAG TTGACCGACC GCAACAACGC CGCCGTGGTG CAGATCAACG GGGTGCGTGA CCGGTTGCGG GTGCTGCTGG TCTCGGGCGA GCCCCATGCG GGGGAGCGGA CCTGGCGCAA CCTGCTGAAA TCCGACAGCG CGGTGGACCT GGTGCATTTC ACCATCCTGC GCCCGCCGGA GAAACAGGAC GGTGTGCCGG TCTCGGAATT GTCGCTGATC GCGTTCCCGA CGCGGGAGCT GTTCCTGGAG AAGATCGACG ATTTCGACCT GATTATCTTC GACCGTTATC GGCGGCGCGG GATCCTGCCG CAGATCTACC TCGCGAACAT TGCGCAATAT GTCGAAGCGG GCGGTGCGGT CCTGGTGGCC GCGGGGCCGG ATTTCGCCTC GGCCAATTCG ATCTATCGCT CGCCGCTGGC GGATATCATC CCCGGCGAGC CCACGGCACG GGTGATCGAG GAAGGCTTCA CGCCGGAGAT CAGCGATGTG GGCAACCGGC ATCCGGTGAC CGCCGGGCTT GCGCCGACCA ACCTGGCCGA GGGGGAAGCG CCCTGGGGCC GCTGGTTCCG CCTGATCGAG GTGGAGCCGA TGGCGGGCGA GGTGGTGATG ACGGGCCCGG GCGACCGGCC GCTCCTGATG CTGGACCGGG TGGGCGAGGG GCGCGTGGCG CTCTTGGCGT CGGATCACGC CTGGCTGTGG TCGCGGGGCT ACGAGGGCGG CGGGCCGCAG CTGGAGCTGC TGCGCCGGTT GGCCCATTGG ATGATGAAGG AGCCCGAGCT GGAGGAAGAG GCGCTGACCG CCACCGCCGA GGGCCAGACC ATGACCATCA CCCGCCGCGC CCTGACCGAC GGGGAGCGGG AGGTGACGGT GATCGGCCCG GACGGGGTCG AAACGGTCGT GCCCCTGACC GAGATCGCGC CGGGGCGCTG GAGCGCGGAG TTCACCGGCG CCGAGCCAGG GCTCTACCGT CTGAGCGACG GGGAGCTGGA CGCGGTGGTG GGCCTTGGCC CGTCTGCGCC GCGGGAGTTC GAGGAAACCC TGGCCAGCGG CGACAAGCTC GCGGCGGCGG TGGATGCGAC CCGGGGCGGG GTGCTGGCGC TGGAGAACGG TGTGCCGCGG CTGCGCGATG TGGGCGAGGG CCGAAATGCC GCCGGGCGCG GCTGGATCGG CCTGACCCCG CGGGAGGCGG CGCTGACCAC GGACCTGCGG ATCAGCCCGC TCATCGCCGC GTGGCTGTTC CTGGCGCTTG TGGCGCTCCT GACCATCGCG GCCTGGCTGC GCGAAGGGCG GCGAAGCACG CCGAAGGCCG GTTGA
|
Protein sequence | MNSGTVIFDP YLPWAVLGAL GVLMAALLLF AGARGLLGAW VRGLATAALL LALANPSLQE EDRSPLTDIV LVVVDETASQ RISDRTEQTE AALEALSAEI TARTNTELRV VRVGDGPGNA GTLLVAAMTE ALSEIPQARL AGTILLSDGQ VHDLGPMPNL PAPLHLLLTG REGDWDRRLV VKTAPAFGIL GEEVELTVRI EDQGDVPAAA QGQVSLEIAV DGGEPFAVDV PVGRDITVPL SLDHGGMNVV QLTVPEAEGE LTDRNNAAVV QINGVRDRLR VLLVSGEPHA GERTWRNLLK SDSAVDLVHF TILRPPEKQD GVPVSELSLI AFPTRELFLE KIDDFDLIIF DRYRRRGILP QIYLANIAQY VEAGGAVLVA AGPDFASANS IYRSPLADII PGEPTARVIE EGFTPEISDV GNRHPVTAGL APTNLAEGEA PWGRWFRLIE VEPMAGEVVM TGPGDRPLLM LDRVGEGRVA LLASDHAWLW SRGYEGGGPQ LELLRRLAHW MMKEPELEEE ALTATAEGQT MTITRRALTD GEREVTVIGP DGVETVVPLT EIAPGRWSAE FTGAEPGLYR LSDGELDAVV GLGPSAPREF EETLASGDKL AAAVDATRGG VLALENGVPR LRDVGEGRNA AGRGWIGLTP REAALTTDLR ISPLIAAWLF LALVALLTIA AWLREGRRST PKAG
|
| |