Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2343 |
Symbol | |
ID | 5713998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2475833 |
End bp | 2476918 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641268267 |
Product | hypothetical protein |
Protein accession | YP_001533680 |
Protein GI | 159044886 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.264147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGTC GTTCTTTTTT GCGGACATCG GCACTGGGCG GGACCGCGGC TGCGGCAAGC GCAGGGTTGG CCGCACCGGC CATCGCGCAG AGCACGCGCA CACTGACCAT GGTGACGTCC TGGCCGCGCG GCTTCGCGGT TCTGGATGAT GCTGCGACCT ACTTCAACGA GGCGGTCAAT GCGATGTCGG GCGGCAGCCT GATCATCGAG AAAAAGGCCC CGGGTGAGCT TGTCGGTGCG TTCGAGGTAT TCGACGCGGT GGCCGCAGGT CAGGCCGACA TCTACCATTC GGCCGACTAC TACTGGATCG GACAGCATCC GGGCTACGCA TATTTCTGCG CGGTGCCTTT CGGCGCGACG GCACAGGAAC TGACCAACTG GTACTACCAT GACGGGGGTC AGGACCTGCA CAACGAGCTG GGCGCGATCT TCGGCATGCA CTCCATGCAG TGCGGCAACT CCGGCTCCCA GTCCGGCGGC TGGTTCCGCA ACGAGATCAC CTCGGCCGAA GACTTCAACG GTCTGCGCTT CCGGATGCCG GGCCTCGGCG GGCAGGTGCT CGGCAAGCTC GGCGCCTCGG TGCAGAACAT CCCCGGTGGC GAGCTCTACC AGGCGCTGTC TTCGGGTGCC CTCGACGGTC TGGAGTGGGT CGGTCCCGCC GCGGACGAGA AGGCCGGCTT CCAGGAGGTC GCCAAGATCT ACTACACCGC CGGCTTCCAC GAGCCGGGCT CCGGGCTGAC CGCATCGGTC AACCTGGACG TCTGGAACGA GCTGTCGCCG GAACATCAGG CGATCATGGA CAACGCGGCC AAGGCCACGA CGAACTACCA GCTGAGCCAG ACACTGGCCA TGAACGGCGC AGCCTTGGCG CGTCTGCAGG CCCAGGGCGT GCGCACCCTG CAATTCCCCG ACGATGTCTG GGATGCGTTC GGCGCGGCCT CCAAGGAAGT TCTCGACGAA AACATGGGCG ACGAGCTCTA TGCCAAGATC CGCAACAGCT TCGACGCCTC GCTGGCCAAG AGCTCGGACT GGCTGCTGAA GTCGGACGCC TATTTCGTGG AGCAGCGCAA CCGCGTGCTC GGCTAA
|
Protein sequence | MDRRSFLRTS ALGGTAAAAS AGLAAPAIAQ STRTLTMVTS WPRGFAVLDD AATYFNEAVN AMSGGSLIIE KKAPGELVGA FEVFDAVAAG QADIYHSADY YWIGQHPGYA YFCAVPFGAT AQELTNWYYH DGGQDLHNEL GAIFGMHSMQ CGNSGSQSGG WFRNEITSAE DFNGLRFRMP GLGGQVLGKL GASVQNIPGG ELYQALSSGA LDGLEWVGPA ADEKAGFQEV AKIYYTAGFH EPGSGLTASV NLDVWNELSP EHQAIMDNAA KATTNYQLSQ TLAMNGAALA RLQAQGVRTL QFPDDVWDAF GAASKEVLDE NMGDELYAKI RNSFDASLAK SSDWLLKSDA YFVEQRNRVL G
|
| |