Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1652 |
Symbol | |
ID | 5713217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1717206 |
End bp | 1718564 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641267568 |
Product | putative solute-binding protein 1 family |
Protein accession | YP_001532995 |
Protein GI | 159044201 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.182094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000465499 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGACACA CACTTCACGC GAGTGCGGCT GCGCTCGCCC TGTCTGCTGG CATGGCCGGA GCCGGAGGCC ATCTTGCGTT CACGCCGGGA GAGGGTGAGT TCAACTGGGA CAGCTATCAA GCGTTCGCCG AGGCCACCGA CCTGTCCGGG CAGGACCTGT CGATCTTCGG GCCCTGGCTC GCCGGGGAGG CCGATGCATT CTCAAACCTT GTGGCCTTCT TCAACGAAGC GACCGGGGCA AATGCGACCT ATGTGGGCTC CGACAGTCTC GAGCAGCAGA TCGTGATTGA CGCCGAGGCG GGTTCCGCTC CGGACCTGAC CGTGTTTCCA CAGCCGGGTC TGGCGACCAC CATGGCAGCG CGCGGCTTCC TGACCCCGCT TCCCGATGGC ACCGACGACT GGCTGCGTGA GAATTATGCC GCCGGGCAGT CCTGGATCGA TCTTGGCACC TATGCGGACG GGTCGGGCAA CGACCAGCTC TACGGCTTTT TCTTCAATGT AAACGTGAAG TCGCTGGTCT GGTACATCCC CGAGAACTTC GAGGATTTCG ATTACGAAGT TCCCGAAACC ATGGAAGAGT TCAAAGCGTT GATGGACCAG ATGGTCGAGG ACGGTCAGAC GCCGCTTTGC GTCGGACTGG GCTCTGGGGG GGCTACGGGC TGGCCGGCGA CCGATTGGGT TGAGGATCTG ATGCTGCGCA CCCAGCCGCC CGAGGTCTAT GATGCTTGGG TGTCCAACGA GATGCCCTTC GACGACCCGC GCGTGGTTGC GGCGATCGAG GAGTACGGCA GCTTCACCCG CAATGACGAT TACGTGGTGG GCAATGCCAA CGACACCGCG TCTGTCGATT TCCGCGAAAG CCCGCTGGGC CTGTTTGCTT CGCCCCCCGC CTGCATGATG CACCGCCAGG CGAGCTTCAT TCCCGCCTAT TTCCCCGAGG GCACCGAGCT GGGCGAGGAT GCGGATTTCT TCTACTTCCC GGCCTTTGAG GAAAAGGACT TGGGGCGTCC GGTTCTGGGT GCCGGTACGC TGTTCGCGAT CACCAACGAG AACCCGGCTG CAAGCGCCTT CATCGAGTTT CTCAAGACGC CCTTCGCCCA TGAGATCATG ATGGCGCAGG ATGGGTTCTT GACCCCGTTC AAGGGCGCGA ACCCCGCGGC TTATGCCAGC GATACGCTGC GCGGGCAGGG CGAGATCCTG ACCAATGCGA CCACCTTCCG CTTCGACGGC TCGGACCTGA TGCCTGGCGG CGTCGGGGCA GGGACCTTCT GGACTGGTAT GGTCGATTAC TCCTCCGGTG CGAAATCCGC CGCCGACGTG GCGAGCGAGA TCCAGGCCTC CTGGGAATCT CTCAAGTAA
|
Protein sequence | MRHTLHASAA ALALSAGMAG AGGHLAFTPG EGEFNWDSYQ AFAEATDLSG QDLSIFGPWL AGEADAFSNL VAFFNEATGA NATYVGSDSL EQQIVIDAEA GSAPDLTVFP QPGLATTMAA RGFLTPLPDG TDDWLRENYA AGQSWIDLGT YADGSGNDQL YGFFFNVNVK SLVWYIPENF EDFDYEVPET MEEFKALMDQ MVEDGQTPLC VGLGSGGATG WPATDWVEDL MLRTQPPEVY DAWVSNEMPF DDPRVVAAIE EYGSFTRNDD YVVGNANDTA SVDFRESPLG LFASPPACMM HRQASFIPAY FPEGTELGED ADFFYFPAFE EKDLGRPVLG AGTLFAITNE NPAASAFIEF LKTPFAHEIM MAQDGFLTPF KGANPAAYAS DTLRGQGEIL TNATTFRFDG SDLMPGGVGA GTFWTGMVDY SSGAKSAADV ASEIQASWES LK
|
| |