Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0991 |
Symbol | |
ID | 5710506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1014095 |
End bp | 1017562 |
Gene Length | 3468 bp |
Protein Length | 1155 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641266901 |
Product | hypothetical protein |
Protein accession | YP_001532334 |
Protein GI | 159043540 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACCA GGGAGCGGCA AGGACTGATT TCACGCTTTT TCTCGCGGGC CAGCGAGACT GACCGTGACG TGCGCACGAA GGAGGGACCT CCTCAATCTC GCCACGAGGC CTGGGGCGAT AACGCAACGC GTTCGGTCAG CGGCTTGGTC AAGTCCGTTC TACAGCAACC AAGTTGTTTG ATTGTGACAG GTTATCAGGA CTTCCTGTCG TCACTCACCA TTTTGCTCGA AACAATTGAA AACCTTGGCG AACGCCCAGA AGGCAGCATC CGCATCGTCT TTGGCTCCAA CTCCGAAACA CGACAGGTAT TGGGTGGCAG CGGCCGGTCA GTAGCCGAAG AAGCGCGCCA ACACTTCCTT GGTGCAAGGG GCTTTTCGGT TGTTGATTTG GCCGATCTGC GGGCAGTTCT TGCAATGGAT GCAATCGAGC GCGGAATCAT CAAGTTCCGC ATTTTCGATC CCGATCTGGC AGAAGAAAAA CTAGGGCGCC GTCCACCAAT GCTTCACGCG AAGCTTTTTG TAGGTGGGGA CAAGGCACTT TCGGGAAGTG CCAATTTCTC GATCAACGGT TTGCGCCGCA ATCTGGAATT CATGGACGAC GCCGATGCCT GGCCCGAACT TGCGACTGCG CGTTGCGCGG CCGCTGAGCA GTATTGGGAC ATGGGGCGCG ATTGGACTGA GACAGCGCTG GAAATCCTGC GAGCGCTCAT CCGCATAGTT TCACCCGAAG AAGCAGTGGC CAGAACGGTT CACGAAGCCG CGACCTTCAC CCCCTGGCGT GTCGCGGGCG AAACCAGCAC CGGACGTCCG CCTCAACCCT TCCAGGCGGA TCTTATCTAT GAGGCTGCAG GGACGGTCTA TGAACACGGC TTTGCCTTTG TCGAAGCGCC AACTGGTGCC GGGAAAACCG ACATTGGGAA ACACCTGGCA ACAGTTCTAC CGGTCAGCCA TGGGCAGACG GTCTTTTCCT GGGGAGAGCG CGCCGATCAA CAACGGCTTG GCTCGCTCGC TCTGATACCT GCCAGCGTTC TGAAGAACTG GACAACGAAC GCGCCTGCGA ACTTCAAACC GATCAAGCAC AGTCACCTGT CGCGGCGAGG TAAGGAGGAG ACTGCGGAAT TGGACGAGAT CAACCGGGCG GTCCGTTCGT CTGCATCAAT GATCGTCGAT GAAAGCCACC GACTGAGCTC GCGATACCTT GCACCTTCTG CACGTTCGCT TGTCTTCGAG CGCAGTCCGG CGATCTGGAC CGCCTGTTTG TCGGCCACGC TAATGGGAAA CCAAGGCCTG GACGGCTTGC TTGCATTCCA TGAGAAGCGC GCCTCGATCT ATGTGCCTCC GCCCATCACG GAGCAGATCA ACCAACACAT GGCCAAGGTT CGAAAGCGGG TAGAGCTCGT GCGACACTTC GACCAGATGA ACCGCCGGAT CGAAGATCAA AGCGTGCAGG ACGACCTGTT CGACAGCGCG AAAGCACTAC AGGTCGAAGT CAACAAAGTC GAACGCCAGC TAGAGACCGG CGGGCTGCAA ATCAGTGCGC TTCAGGAAGG ATTGGCCGAC GCCCTGGCGC CATACGTCGT GCGGCGCCAA CGGGATTGCA TTGGGGAGAG TGCGGATCGA AAGTCCGGCG CTTTCGTTTA CCCAACGATC CGAAGCCACC GAGAGGATAC GGCGCTGAGT GATCAGCAGC GCCAAATCAT TGAGCGGATC AAGGCATTGG CGGAGTGCAT TACGACTGGG GTCACCCTTG TGTCTGCTGA TCCCCAACGC GCTGCCCACA CAGAAATCAA ACTTCACGAC AAATCGCGTA TTCACATCCG GAACTTCCTC GCTTTGCTGC GCGCGTCGAT AACCTTTGCA CGCGAAGAAT GGGCCCGCGA GCGCGACAGT GAAGCCGATC AGCGCGGGCG AGCATCAATT GGAGAAAACC TAAGACGCGC CGAGAAACAG AATGCCCGCG GGATTGCCGT TCCGGATGGA GCGCTGCCCG AGGAAGTCGA TGCCGAAACT GAGGACAGCG AAACTCCGAT TTGCGACCGG ATCAGTAGGC TTCTGAATCA TCCCAACCTC GACAGCATTG ATGAATCCCG GGCCGGCGTC ATGCGAGACA TCCTTCGAAA GCATGGCCAT GCCATCTTTC TTGCTGAGCG GGTCGGCGTG CTCGAGGTCT ATGCCAGGCT TCTTGCTGGG CATCGCGGAA GAGGACCCGA AGTCTTCGTG GTCGCACCTG GTGCACGGAT CAAGACCGGT AAGAAGCTGC ATCACATCCG ATCGGGAGCA GATGCGCAGG AGTATTTCGG CATCGATGGG AAACATGTCG ACCCGCTAAA GCCACGGGCC ATGTTCCTCA CATTCCAGAT GGCGGAAGGG ATCAACCTCC AAATCGCCTC TGCACTTGGG ATCATTGGCG TCACGTCCGA CGTCAAGAGC CTCATTCAGG GGTTGGGCCG GATCGACCGG ATCGACAGTC CCAATTCCCG CATCCACTAC TACACCTTCG ACCTGCCCGG GCTGGTTCTG TCATCGGATC ACAAGGCGCG CGCTCGTGTC GCCAGCATCG CTCTCCTGTC AGGTGTTGGC GCCGCAGACG TACCATCAGA GCTGGTTGAG TTTTCTGCCG GTGATCTGAC CGATCTTGTT CTCGAACAGG TCAAGAAACC GCGCATCCTC CGTTCCAACA ACTACTTCGA CCAGTTGGAA GCTCTTCGGC GTGTATTGCC AGCTGACGTT CTGGGCCGGG TCTGGAACGC AAAACCTCGC GGTCTCTGGG GAGCCGAACT CTGCCTTCTG TCCTCAGTTG AACCAAACAC CGTCCTCCTC TTGGGCGGTC GCACAGGCAG CCCCCAAGAT CCTACTGTCC TACCGCCTCG CTTGATTGCC GTTCGAGATG TCGATGGGCG CGCAGAGATC ATTGGCGACC AGGTTGAAGC GGCTAGGCTC CTATCCGCCG CCTATGCCGA AACGCGCCGC CGAGGCATGC AAAGCCACCG CCCAGAACTG AACGCGATAT CCGCTACGTT CAGCCGTCTC GGCAGCACCC TTGCGCATCT GACCCACTGG GATGTTCGGC CAGCTCGAAC GGTTTCACTT CTTTCCTCTC TCGCTAATTT TCTGTCAGGC CACGACACAG GTGATGCTGG TCAAGGGCTC TTCGGTTCCT TAACTCTGCC GACGCTGGAA AAGCTGGCAG AAGCCTGGGC ACACGAACTC GATGTTTTCT GGATCGAGGC GAAGGAGACG GTTAGCCAAA GGAGTGCGTC TGGCGGCGAA ATACCCGACT ACCTCGGGAT CGACGCGATC TGCCGAGCTT TCCAAGAACA GCCAGACGAT GTCCGCTCGG CTGTAAGTGA AAGGATGAAG GAGCTCTTGG CTCGGTGCAC GGCATTGTCA GAGGGGCAAT CGATCGATGT ACTGAGCCGG GTTGCCGTCA TTTTTGAAGT CGAGCCTGAC CGCAGACTAG GCCGTTGA
|
Protein sequence | MGTRERQGLI SRFFSRASET DRDVRTKEGP PQSRHEAWGD NATRSVSGLV KSVLQQPSCL IVTGYQDFLS SLTILLETIE NLGERPEGSI RIVFGSNSET RQVLGGSGRS VAEEARQHFL GARGFSVVDL ADLRAVLAMD AIERGIIKFR IFDPDLAEEK LGRRPPMLHA KLFVGGDKAL SGSANFSING LRRNLEFMDD ADAWPELATA RCAAAEQYWD MGRDWTETAL EILRALIRIV SPEEAVARTV HEAATFTPWR VAGETSTGRP PQPFQADLIY EAAGTVYEHG FAFVEAPTGA GKTDIGKHLA TVLPVSHGQT VFSWGERADQ QRLGSLALIP ASVLKNWTTN APANFKPIKH SHLSRRGKEE TAELDEINRA VRSSASMIVD ESHRLSSRYL APSARSLVFE RSPAIWTACL SATLMGNQGL DGLLAFHEKR ASIYVPPPIT EQINQHMAKV RKRVELVRHF DQMNRRIEDQ SVQDDLFDSA KALQVEVNKV ERQLETGGLQ ISALQEGLAD ALAPYVVRRQ RDCIGESADR KSGAFVYPTI RSHREDTALS DQQRQIIERI KALAECITTG VTLVSADPQR AAHTEIKLHD KSRIHIRNFL ALLRASITFA REEWARERDS EADQRGRASI GENLRRAEKQ NARGIAVPDG ALPEEVDAET EDSETPICDR ISRLLNHPNL DSIDESRAGV MRDILRKHGH AIFLAERVGV LEVYARLLAG HRGRGPEVFV VAPGARIKTG KKLHHIRSGA DAQEYFGIDG KHVDPLKPRA MFLTFQMAEG INLQIASALG IIGVTSDVKS LIQGLGRIDR IDSPNSRIHY YTFDLPGLVL SSDHKARARV ASIALLSGVG AADVPSELVE FSAGDLTDLV LEQVKKPRIL RSNNYFDQLE ALRRVLPADV LGRVWNAKPR GLWGAELCLL SSVEPNTVLL LGGRTGSPQD PTVLPPRLIA VRDVDGRAEI IGDQVEAARL LSAAYAETRR RGMQSHRPEL NAISATFSRL GSTLAHLTHW DVRPARTVSL LSSLANFLSG HDTGDAGQGL FGSLTLPTLE KLAEAWAHEL DVFWIEAKET VSQRSASGGE IPDYLGIDAI CRAFQEQPDD VRSAVSERMK ELLARCTALS EGQSIDVLSR VAVIFEVEPD RRLGR
|
| |