Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2029 |
Symbol | |
ID | 8419874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2327433 |
End bp | 2328260 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645038617 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003198891 |
Protein GI | 258406149 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00497658 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00148627 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCTTCG GCAAAATTCT CTTTCTGGCC GTCATTTTTG CCCTGTTCTC TGCCCAGGCA GTGCTTGCCG GCCCCACGTA CGACCAGGTC ATGAAAGACA AGGTCATCCG CGCCGGCTTG ATGACCGACT CCATTCCTGG TGCATTTTAC AACAAAGACA ACGAGTGGGT CGGCTTTGAC GTCGACATCG CGAAAGAAAT CGCCAAGCGA CTCGATTGCG AACTCAAACG GGTGCCGGTG ACCAACAAAA CCCGGATCGC CTTTGTTCAG CAGGGTCGCA TCGACATGTC TGTGGCCAAC ATGACTCATA AACGCGAACG GGACAAATCG ATCGACTTTT CGATCACCTA CTTTTTCGAC GGCCAAAAGC TCCTCGTCAA AAAGGGCAGC TTCGCGAACT GGGACGAGAT CGTAGGCCAG AAGATTGCCA CCATGCAGGG CACGACCTCG GAGGTCAATC TGAAAAACAA GCTGGAAGAG CTCGGTGACA CCAACGCCGA CGACAACGTC ATCTCCTTCC AGAAGGAATC GGAATGCTTC CAGGCCCTGG AAATGGGCCG CGTTGCCGGC TGGTCCACAG ACTCGACCAT CCTTCTGGGC TATGCCGCGA AGCGTCCGGG AGAATACGAA CTCATCGGCG ACTTTATCAG CGACGAACCC TATGGCATCG GCCTGCCTGA AGACGATTCC AAATGGCGCG ACACCATCAA CTTCACCATC CAGGACATGT GGCTGGATGG GACCTACATG GACATCTACA ACAAATGGTA TGGTCCGGAC ACTCCGTACT CCTTCCCCAT GACTGAACAA ATCGAAGTCT GGCCCTAG
|
Protein sequence | MRFGKILFLA VIFALFSAQA VLAGPTYDQV MKDKVIRAGL MTDSIPGAFY NKDNEWVGFD VDIAKEIAKR LDCELKRVPV TNKTRIAFVQ QGRIDMSVAN MTHKRERDKS IDFSITYFFD GQKLLVKKGS FANWDEIVGQ KIATMQGTTS EVNLKNKLEE LGDTNADDNV ISFQKESECF QALEMGRVAG WSTDSTILLG YAAKRPGEYE LIGDFISDEP YGIGLPEDDS KWRDTINFTI QDMWLDGTYM DIYNKWYGPD TPYSFPMTEQ IEVWP
|
| |