Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1834 |
Symbol | |
ID | 5712825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1913539 |
End bp | 1914555 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641267757 |
Product | putative sialic acid-binding periplasmic protein |
Protein accession | YP_001533177 |
Protein GI | 159044383 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.557031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTC TTTCCCGGAT GACGTCCAGT GCGGCAATTG CACTGACCGC GACCATCATG AGCATGGGCG TGGCCGATGC AAAGAACTTC AAGATCGCCG TCGGTGACAG CGGCGGCAGC AGCCAGGAGG CCACCGGTCT CGCCTTCGTC GAAGCCCTCG AAGAACTTTC GGGCGGTGAG CACACCGGAA CCCTGTTCCT GAACGGACAG CTCGGCTCCG AGCAGGACAC CGTCAACGAG GCCGCCATCG GCACGCTCGA CATGTCGATC CTGGCCATCA ACAACGTCAC GCCCTTCTCG CCCACCGTGG GCGTGTTCTC GCTGCCCTAT GTCATCCTCA GCCTCGAAGA TGCCGAGAAG CTGACCCAGG GTCCGATCGG CGCAGAGCTG ACCGAGAACA CCATCAACGA CGCAGGCGTG CGCATCATTG CCTGGACCTA CACCGGCTTC CGGCGCCTGA CCAACTCCAA GCAGCCGGTC ACCTCGGTCG CCGACCTGCA GGGGTTGGTG ATCCGCGTTC CCAAGAACGA GATCATGATC GACACCTACA AGGCCTGGGG CATCAGCCCG ACGCCGATGG CATGGTCCGA GACCTTCGCC GGCCTGCAGA CCCAGGTCGT CGATGGCCAG GACAATCCCT ACATCACCAT CAACGCGATG AAGTTCTACG AGGTACAGAA GTACGTCACG AACCTGCGCT ACATCTTCTC GATCGAGCCG CTGATCATCT CCGAGCAGGT GTTTCAGGAA CTCTCCGCCG AAGACCAGGA GATCGTTCTC GAAGCCGGCA AGCGCGCCAC GGCCGCCTCG TCCCAGTTCC TCCGCGAGAA GGAAGCCGAG ATCAAGGAAC TGCTCATCGA AAAGGGCATG GAGATCATGG ATCCGGTGAA CAACGAGGAA GAGTTCATCA CGCTCGCGAC CGAGGCCGTC TGGCCGAAAT TCTACGACAG CATCGGCGGC ATCGAGAAGA TGAATGCCGT CCTGGCCGAG ATCGGCCGCG ATCCCGTCAC CGAATAA
|
Protein sequence | MTFLSRMTSS AAIALTATIM SMGVADAKNF KIAVGDSGGS SQEATGLAFV EALEELSGGE HTGTLFLNGQ LGSEQDTVNE AAIGTLDMSI LAINNVTPFS PTVGVFSLPY VILSLEDAEK LTQGPIGAEL TENTINDAGV RIIAWTYTGF RRLTNSKQPV TSVADLQGLV IRVPKNEIMI DTYKAWGISP TPMAWSETFA GLQTQVVDGQ DNPYITINAM KFYEVQKYVT NLRYIFSIEP LIISEQVFQE LSAEDQEIVL EAGKRATAAS SQFLREKEAE IKELLIEKGM EIMDPVNNEE EFITLATEAV WPKFYDSIGG IEKMNAVLAE IGRDPVTE
|
| |