Gene Dret_0557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0557 
Symbol 
ID8418369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp669390 
End bp670364 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content55% 
IMG OID645037125 
ProductExtracellular solute-binding protein 
Protein accessionYP_003197432 
Protein GI258404690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGGA AATTGATCTG TTTTCTCGGA ATCAGCCTCG CGTTAGCCGC CCCAGCGTAT 
GGCGCCGCCA TGAAAATGAA TTGCAATGCT ATTTATCCAG CGTCGAATTT CCACACCCAA
GGTGCTGAGC ATTTCGCCGA ATTGGTCCAC AAATATACTG ACGGCGATAT CCAGATCACG
GTCCACTCCG GCGGGAGTCT GGGGTTTGAA GGCAGCGAAC TGCTCAAAGC TGTCAAGGAC
GCCTCCGTGC CCATGTCGGA TATCCTGATG GGCGTGGTTG CCGGCAGTGA AGAAATTTTC
GGTTTGAGCA CATACCCGCG GATCGTCAGT TCGTATGCCG AGGCACGGGA ATTGTATGAG
GCTGCATTGC CTGCGTACAA AAAAGCCTGC CAAAAATGGA ACCAAAAATT CCTGTACGCC
GCTCCGTGGC CGCCCAGCGG TCTTTTCAGC CAGTCCAAGG TTCAATCCGC TGCGGATATC
GATGGACTCA AGACCAGGAC CTACGACAAG AATGGAGCGC AATTCTTGAA GAAGCTTGGC
GGCAACCCGG TGTCCATGCC CTGGGGCGAG GTGCCGTCGG CTCTGAATAC CGGTCTGATC
GATTCCGTTT TGACCTCGGC TACCTCCGGC AAGGACGGCA AGTTCTGGGA AGTCCTGGAC
CATTTCACCG CGCTGCATTT CGCGTATCCG CTCAATATGC TGACCATCAA TATGGACTAT
TGGAACGCCT TGTCCGCTGA ACAGCAGTCG GCGTTGGAAA AAGCGGCCGC AGAGACCGAG
TCCTTCCAGT GGGAAGCTTC GAAAAAGAGC AATCGTGACT CGTTGAAGGT CTTGGAGGAC
AACGGCCTGC GAATCACTGA GGTGGATGCG GCTCTGGCCG AAAAATTGGA CGCGGCTGCC
GCGGACATTT TTGAGGAATT CAAAGCCGAG GCGGACGAAG ATACCAAAAA GGCCCTTCAG
GCCATCGGGA TGTAA
 
Protein sequence
MLRKLICFLG ISLALAAPAY GAAMKMNCNA IYPASNFHTQ GAEHFAELVH KYTDGDIQIT 
VHSGGSLGFE GSELLKAVKD ASVPMSDILM GVVAGSEEIF GLSTYPRIVS SYAEARELYE
AALPAYKKAC QKWNQKFLYA APWPPSGLFS QSKVQSAADI DGLKTRTYDK NGAQFLKKLG
GNPVSMPWGE VPSALNTGLI DSVLTSATSG KDGKFWEVLD HFTALHFAYP LNMLTINMDY
WNALSAEQQS ALEKAAAETE SFQWEASKKS NRDSLKVLED NGLRITEVDA ALAEKLDAAA
ADIFEEFKAE ADEDTKKALQ AIGM