Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2456 |
Symbol | |
ID | 8420318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2817806 |
End bp | 2818819 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 645039059 |
Product | Extracellular solute-binding protein |
Protein accession | YP_003199316 |
Protein GI | 258406574 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGCC GTATCGCCTT GTCACTGATG CTCCTCATGG TTTTCCTGGG GACCACTGCC GTCTCTACCT CAGCCACCAC ATTGACCTAT GCCAATTTTC CTCCCGCCCC CACATTTCCC TGCGTTCAGA TGGAGCGCTG GGCCGATGAA GTCGAAAAAC GCACCGACGG GGCATTGACC ATCGAGACTT TCCCCGGTGG CACCCTGCTC GGGGCCAAAC AGATCTGGCG TGGCGTCCAA TACGGTCAGG CCGACATCGG CTGCATCAGC CTGGCCTACC AGCCGGGTCT TTTTCCTTTG ATGTCCGTTA TGGAACTGCC ACTCGGGCTT CCTTCAGCTG AAACCGCGAG TACCCTCATG TGGGATCTCT TTACCAGCTA TTCCCCCGAA GAGTTCGACA AGGTCAAAGT CCTGACCATG TTCACCTCAG CCCCCTCTAA TATCATGAGC AAAAAACCGC TTCCCGACCT GGCCAGCCTC CAGGGCGTGG AATTGCGTGG CTCCGGTACC GCGTCCCGCA TCCTCGAGGC CCTGGGGGCC ACCCCGGTCT CCATGCCCAT GCCGGACACC CCTCAGGCCT TGCAAAAAGG TGTTGTCCAG GGCCTTTTCT CCTCGCTGGA AGTCCTTAAA GACCTCAATT TCGCCGCCTA TTGCCAGCAC GTGACCCGTA CCGATCTCCA GGTCTATCCC TTTGCCGTGA TCATGAACAA ACGGGTTTGG GAGGATTTGC CCGAGTCAAC CAAGAAGATT TTGAACGAGC TTGGACCGGA ACAGGCGGCC TGGACCGGCC GGTATATGGA CAATCACGTC CAGAAAGCGC TCGCCTGGGC CCAAAAGGAA CACGGTCTGA CCACCCACGC CTTGTCGGCA ACTGCATTGG AAGCCGTCCA ACCCAAGCTC GACAAACTCA TTGAGGAATG GGTCCAGGAC GCCTCGGCCA AAGGACTTCC TGCCAAAGCG GTTTTGCGCG ATATCAGCGC CCGTCTGGAC AAAGCTGAGG CGAAAGGGGA ATAG
|
Protein sequence | MQRRIALSLM LLMVFLGTTA VSTSATTLTY ANFPPAPTFP CVQMERWADE VEKRTDGALT IETFPGGTLL GAKQIWRGVQ YGQADIGCIS LAYQPGLFPL MSVMELPLGL PSAETASTLM WDLFTSYSPE EFDKVKVLTM FTSAPSNIMS KKPLPDLASL QGVELRGSGT ASRILEALGA TPVSMPMPDT PQALQKGVVQ GLFSSLEVLK DLNFAAYCQH VTRTDLQVYP FAVIMNKRVW EDLPESTKKI LNELGPEQAA WTGRYMDNHV QKALAWAQKE HGLTTHALSA TALEAVQPKL DKLIEEWVQD ASAKGLPAKA VLRDISARLD KAEAKGE
|
| |