Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0528 |
Symbol | araH2 |
ID | 5711981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 512265 |
End bp | 513269 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641266430 |
Product | ribose/xylose/arabinose/galactoside ABC-type transport system protein |
Protein accession | YP_001531875 |
Protein GI | 159043081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAT CTGTGAAAGA CTTCACCGCC AGCCCGGAGT TCCGGTTGCT GATCATCATG GTTTTTGTCT TCGGCTTGAT GTCAGTCCTG TCGCCGGATC GCTTTCTGTC ATCTCAGAAC CTGACCTCGA TGGCGTTTCA GTTTCCCGAG TTCGCGATCC TCGCACTGGC CATGACCATC GCCATGATGA CCGGCGGGAT CGACCTCTCG GTGGTGGGCA TCGCCAACCT GTCGGCGGTT GTCGCCGCAC TGATCCTCAC CCATTTTTCG AATGCCGAAA TGCCCGCCGC GCAATCCGCG ATCTGGCTCG CAATCGCGAT CACCGCAGCG CTGTGCATCG GGGCGATTGC CGGTTTGATC AACGGATCGC TGGTGGCATT CTTCGGCCTT CCGCCCATTC TCGCCACCCT GGGATCCGGT CTTGTGTTTA CCGGTTTTGC CATTGCGATG ACCGGTGGCA GCGCAGTCAT GGGCTTTCCC GACACGGTCG CCTTGATCGG AAACGCGACC TTGGCGGGTG TGCCTGTCCC TCTCATCCTG TTTGCGGTTC TGGCGTTCCT GCTGCACCTG GTCCTGACCC GCACCGCCTT CGGATTGCGC GTGACGATGT ACGGGGCCAA TCCGCTCGCG GCGCTTTATG CCGCCATCGA CATCAACCGC ATGCTGCTCA AGGTCTACGT GATCGCCGGG ATGTTCGCGT CGGTTGCCGG GCTCATCATA ATGAGCCGTG CCAATTCAGC GAAGGCCGAT TACGGGTCTT CCTACCTGCT GCTGGCGGTT CTGATCGCCG TGCTGGGCGG GGTGAACCCC TATGGTGGCT ATGGTCGCGT CATTGGCGTG GTGTTGGCGG TGCTGTCTAT GCAGTTCCTG TCCAGCGGCC TGAACATGCT TGGGGTGTCA AATTTCGCAC GCGAGTTGAT CTGGGGCGCG CTGCTAATCC TCGTCATGGT GATCAACACC CATGCCGTGA CCGCCCTGCG CAGTAGCCTG AAGCCACCAA AATAA
|
Protein sequence | MNTSVKDFTA SPEFRLLIIM VFVFGLMSVL SPDRFLSSQN LTSMAFQFPE FAILALAMTI AMMTGGIDLS VVGIANLSAV VAALILTHFS NAEMPAAQSA IWLAIAITAA LCIGAIAGLI NGSLVAFFGL PPILATLGSG LVFTGFAIAM TGGSAVMGFP DTVALIGNAT LAGVPVPLIL FAVLAFLLHL VLTRTAFGLR VTMYGANPLA ALYAAIDINR MLLKVYVIAG MFASVAGLII MSRANSAKAD YGSSYLLLAV LIAVLGGVNP YGGYGRVIGV VLAVLSMQFL SSGLNMLGVS NFARELIWGA LLILVMVINT HAVTALRSSL KPPK
|
| |