Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3863 |
Symbol | |
ID | 5714392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 72755 |
End bp | 73885 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641276776 |
Product | hypothetical protein |
Protein accession | YP_001542072 |
Protein GI | 159046401 |
COG category | [S] Function unknown |
COG ID | [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0984668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC CCTATATTTC CAGCCTCTAC CCCGAGGTCC CCGGCCCGCC CTTCCCCAAG ACGCAAGAGT ACCTGCAATG CGCGGGCAAC AACCTGGGCA ACTTCATGTT CTGCTCCTCG GTCCGCCGGA TCGTGCGCAC CACGACCCAC CCGCGCGGGG ATTTTCGCCG GCTCGACCTC AAGACCATCG CCGCGGAATG CGACGGGATC GTCATCGCGG CGGCCAACTG GCTTCAGCCC AAGCAGAACT ATGGCGGCCT GGCCGACCAG ATCGAAAAGG CGAACGTGCC CGCGGCGATC ACCGGCATCG GCGCCCAGAG CTCCGGCGGC AAGATCCCCG AATTGCTGCC CGGCATGCTG CGGCTGCTCA AGGTGGTCTC CGAGCGGTCC CACTCGATCT CCGTGCGTGG CCCGTTCAGC GCCGAGGTGC TCAACCACTA CGGCATCCAG AATGTCACCG TCACCGGCTG CCCGTCGCTG CTGTGGCACC GGGACCACCC CGCCGAGATC ACCCGCCTGC CCCGGGACGG CCGGGTCGGC CGGGTCACGC TCAACGGCAC CCTGCACCGC TTCGACATCC CCAAGACCCC AGGCAAGGTG GTCAAGCTGA CCCGGTTCAC CCTCCTGCAG GCCATGGCCT GGGGCTGCGA CTACGTGGTG CAGAACGAAC GCCCCTTCCT GCAGGCCCAT CTGGGCGAGC TCGCCGAGGA CGACCAAGAC AGCTGGGACT TCCTGCATTA CGTGTTCGAC GAGCCGGACC GCGCGATTCT GAAAACCTAC CTGGAACGCC ATATCCAGGC CTTCCCGAAT ATTCCCGAAT GGATGGCCTA TTGCGCGAAT CATGACCTGG TGCTGGGCAG CCGCCTGCAC GGGGTGATCG TGGGGCTGCT CTCGGGAACA CCGGGGGTGC TGATCACCCA TGACAACCGG ACAGAGGAAA TGGGCCGCTT TGCCGGCATT CCGACCATCA CCGCCGAGGA TTTCATGTCG CGGCCCAAGA TCGACCCGGA CGCGATCCTC GCCGAGGCCG ATTTCGACGC CTTCAATGCC CGGCAGAAAG ACTATTTCCG GGACTTTGTG GCCTGGTTCG ACGCCAACGA GATCCCGCAT CGGTTGACCG TCACCCCATA G
|
Protein sequence | MKNPYISSLY PEVPGPPFPK TQEYLQCAGN NLGNFMFCSS VRRIVRTTTH PRGDFRRLDL KTIAAECDGI VIAAANWLQP KQNYGGLADQ IEKANVPAAI TGIGAQSSGG KIPELLPGML RLLKVVSERS HSISVRGPFS AEVLNHYGIQ NVTVTGCPSL LWHRDHPAEI TRLPRDGRVG RVTLNGTLHR FDIPKTPGKV VKLTRFTLLQ AMAWGCDYVV QNERPFLQAH LGELAEDDQD SWDFLHYVFD EPDRAILKTY LERHIQAFPN IPEWMAYCAN HDLVLGSRLH GVIVGLLSGT PGVLITHDNR TEEMGRFAGI PTITAEDFMS RPKIDPDAIL AEADFDAFNA RQKDYFRDFV AWFDANEIPH RLTVTP
|
| |