Gene Dret_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0073 
Symbol 
ID8417877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp94117 
End bp95109 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content56% 
IMG OID645036638 
ProductExtracellular solute-binding protein 
Protein accessionYP_003196953 
Protein GI258404211 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.303636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG CGCTTTGGAC GGTATTTATC GTCGGACTTG CCCTGAGCCT CTGTACTGCG 
GCTCAGGCCC GGACCTGGAA AGTGTCCCAC GTTCGCCCCC AGGACACTGC CATTGACAAA
GATCTCAACG CTTTCGTGCA GGACGTTGAC GAGGCCACCA ACGGAAAAAT CAACATCAAA
GTTTACGCTG CCAGTTCCTT GGGTGACTAC ACCGTCGTGC AGGAACGGGT CGGCCTCGGC
GCTGTGGAAA TGGCCTGCCA GCCCCCGGCG ACCGGTGCGG ACAAGCGGTT TCAGATCCAA
TACTTCCCAT ACTTGGTGAA AAACTACGAC CAAGCCAAGA AGAATTTTGG CCCTGACGGC
CCCTTGCGCA AAGAAATCGG CAAGCTCTAC GGTGAGCAGG GCATCGAACT TCTGGCTGCC
TGGCCGGTGT ACTTCGGCGG CATCGCCCTG AAAGAAGAAC CCAAGAACCC CGGTGACCCC
ACGGCCAAAA AAGGTCTCAA GGTCCGCGTT CCGCCCATGA AGACCTTCCA GATGCTGGCC
AATAACATTG GCTACATGGC GACACCGCTG CCGTTCTCGG AAGCCTTCAC CGCCGTGCAA
ACCGGTGTTG TCGACGGCGT GATCGGTTCC GGTGCTGAAG GGTACTATGC TTCCTTCCGC
GACGTGACCA ACTACTATGT CCCGATGAAC ACCCACTTTG AAGTCTGGTA CCTCATCGCC
AATGAACGCA TGGTGGAAGG GCTGGACAAG GACGAAATGG CTGGCTTGAA AGCCGCTGCC
CAGCGCTTTG AAGAAAACCG CTGGGACCAA GTGGTCGAAG ACCAGAAGAA AAATGAACAG
CGCCTGGCTG ATTACGGTGC TGAAATCATC GAAATTACTC CTGAAGACCT GACGAAGACC
GCCGAAATCG TGCGCGAAAA CGTCTGGCCT GAAATCCTGA GCGACGTTGG CACCGAATGG
GGCCAATCCG TTCTGGATAA CATCAAGGAG TAG
 
Protein sequence
MKKALWTVFI VGLALSLCTA AQARTWKVSH VRPQDTAIDK DLNAFVQDVD EATNGKINIK 
VYAASSLGDY TVVQERVGLG AVEMACQPPA TGADKRFQIQ YFPYLVKNYD QAKKNFGPDG
PLRKEIGKLY GEQGIELLAA WPVYFGGIAL KEEPKNPGDP TAKKGLKVRV PPMKTFQMLA
NNIGYMATPL PFSEAFTAVQ TGVVDGVIGS GAEGYYASFR DVTNYYVPMN THFEVWYLIA
NERMVEGLDK DEMAGLKAAA QRFEENRWDQ VVEDQKKNEQ RLADYGAEII EITPEDLTKT
AEIVRENVWP EILSDVGTEW GQSVLDNIKE