Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1098 |
Symbol | |
ID | 5711066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1126077 |
End bp | 1127306 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641267009 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001532441 |
Protein GI | 159043647 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTCG TGCCCCTTGC CGAAGCCGCC GAACTGCCCC GTTGGCGGCA ACCGGTGACG CTGCTGTTCC TGATGGCCTT TGCGATGCCC ATCGCGTTTG CCACCTGGAG CGCGCTTCTC AACAATTTCG TGATCGAGCG GGCGGCCTTC ACCGGTGTGG AGATCGGCTG GCTGCATACG GTGCGGGAGA TCCCGGGCTT CCTCGCCGTG GGTGTCATCG CCCTGATCAT GATCATCCGC GAACAGGTTC TGGGCCTCGC CTCGCTCTGC CTTTTGGGGC TGGCCAGTGC CATGACCGGG TTTTTCCCGA CCTATTACGG CATTCTGGCG CTGACGCTGC TCAGTTCCAT CGGGTTTCAC TATTATGAGA CGGTCAATCA GTCCCTGCAA CTGCAATGGA TCGCCAAGGA CCGCGCCCCG CAGACCCTGG GCTGGATCGT GGCGGCGGGC TCGGCGGCGT CGCTCCTGAG CTATGGCACG CTGGTCGTCA GCTGGAAGGC CTTCGATCTG AGCTACAGCT TCGTCTACCT GACATCGGGC GGGATCACGG CGGCGATCGC GCTGTTTTGC ATGTTCGCCT ATCCGCAATT CGAGAGCCCC AATCCGCAGA ACAAGCGCAT GGTGCTGCGG CGGCGTTACT GGCTTTACTA CGCGCTGCAA TTCATGTCCG GGGCGCGGCG GCAGATTTTC GTGGTCTTCG CCGCCTTCAT GATGGTCGAG CATTTCGGCT TCGAGGTGCA TCAGGTCACG GCCCTGTTCC TGATCAACTT CCTTGCCAAC ATGCTCTGTG CGCCGCTCAT GGGCAAAGCC GTCGCGCGGT TCGGGGAGCG CAACGCGCTC GCCTTTGAAT ATACCGGCCT GATCTGTGTC TTCCTGGCCT ATGGCGGCAT CTATTGGCTG GGCTGGGGGG TGATGGTGGC CGCGGCGCTC TATGTGCTGG ATCACCTGTT CTTCGCACTG GCTTTCGCGC TGAAGACCTA TTTCCAGAAG ATCGCGGACC CCGCGGACAT CGCGCCCACG GCCGCGGTGG CCTTCACGAT CAACCATATC GCGGCGGTGT TCCTGCCGGC CTCGCTGGGC TACATGTGGG TCGTGTCGCC GGCGGGCGTT TTCTGGCTGG CCGCTGCCAT GGCTGCGACC TCTTTGTTCC TGTCGTTGCT GATCCCGCGC CATCCGGGCC CGGGGCACGA GACGATCTTC CAGTCCCGCC TCGCCACCCC GGCGGAGTAG
|
Protein sequence | MRLVPLAEAA ELPRWRQPVT LLFLMAFAMP IAFATWSALL NNFVIERAAF TGVEIGWLHT VREIPGFLAV GVIALIMIIR EQVLGLASLC LLGLASAMTG FFPTYYGILA LTLLSSIGFH YYETVNQSLQ LQWIAKDRAP QTLGWIVAAG SAASLLSYGT LVVSWKAFDL SYSFVYLTSG GITAAIALFC MFAYPQFESP NPQNKRMVLR RRYWLYYALQ FMSGARRQIF VVFAAFMMVE HFGFEVHQVT ALFLINFLAN MLCAPLMGKA VARFGERNAL AFEYTGLICV FLAYGGIYWL GWGVMVAAAL YVLDHLFFAL AFALKTYFQK IADPADIAPT AAVAFTINHI AAVFLPASLG YMWVVSPAGV FWLAAAMAAT SLFLSLLIPR HPGPGHETIF QSRLATPAE
|
| |