Gene Dret_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1361 
Symbol 
ID8419190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1588922 
End bp1590472 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content56% 
IMG OID645037937 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003198227 
Protein GI258405485 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.66548e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.279813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGGG TCCTATGTAT TTTGAGTGTG CTGATTTTCA GCATACTCCT GGTCGGGAGT 
GTTCAGGCGA AGACGTTGCG CTTAGCCATG GACGCAGATC CCTATTCCTT GGATCCCCAT
GTGCAGCTGT CTGGCGGCAT GCTCCAGTAT TCCCATCTGG TCTTTGACCC GTTGGTGCGC
TGGACCAAGG ATATGGAATT CGAACCCCGC CTGGCCACGA GCTGGGAACG TATCGATGCG
ACCACCATGC GCTTCCACCT GCGCGAAGGG GTCACCTTCC ATTCCGGCAA CCCGTTTACG
GCCAAGGATG TTGTCTGGAC ATTGAAACGC TTAAAGAAAA GCCCGGATTT CAAAGGCTTG
TTTGAACCAT TCGAGGGAGC CAAAGCGGTG GACCAATACA CCGTGGATGT GGTGACCAAA
AAACCTTATC CTTTGCTGTT GAACATGGCG ACCTATATTT TCCCCATGGA CAGCGAGTTC
TACACCGGTG AGGACAAGAA CGGCAATCCG AAGGACGCGA TCAAGAAGAT CGGCTACACC
TTTGCCAACA CCCACGAGTC CGGTACCGGG AAGTACAAGG TTGTTGAGCG GCAGCAGGGC
GTCAAGGTCG TCTACGAGGC CTACGACAAC TATTGGGACG AGGACAGCGG CAATGTCGAC
AAGATTATCC TCACGCCGAT CAAAAAGGAC TCCACCCGCG TGGCGGCCCT TCTTTCCGGC
GACGTGGATT TCATCATGCC TGTGCCGCCG CAGGATTACG ACCGGCTGGA AAAACGGGAT
GGCATCGATC TGGTGACCAT GTCCGGTAGC CGGGTGATCA CGTTCCAGCT GAACCAGGAA
CGCCGCCCTG AATTTGCGAA CAAGAAAGTG CGCCAGGCCA TTGTCCATGC CGTGAACAAC
GTCGGCATTG CCCAGAAGAT CATGGAAGGC CGGGCCACTC CGGCCGCGCA GCAGGCCCCG
GAAGGGTTTG CCAGCTACCA GCCTGAGCTG ACGCCGCGCC ACGATGTGGC CAAGGCCAAG
GAACTCATGA AGGAAGCCGG CTATCCCGAC GGCTTTGAGT GCTCCATGAT TGCCCCGAAC
AATCGGTACG TCAAAGACGA AAAGATCGCT CAGGCTGTGG CGGCCATGCT CTCCAAGATC
GGGATCAAGG CCAATCTGAC CACCATGCCC AAGGCCCAGT ACTGGAACAA GTTCGACGCC
CAGGTGGCCG ACATTCAAAT GATCGGCTGG CACCCGGACA CCGAGGATTC GGCCAATTAC
ACCGAATTCC TGCTCATGTG CCCGAACAAG GAAACCGGAT ACGGCCAATA CAACAGCGGC
AACTACTGCA ACAAGGAAGT CGATCAGTTC ATTCTGGACG CCCAGACCGA GACCGATCAG
GAGAAGCGGA CCGCCATGCT GAAGAAGGTG GAGCGGATCC TCTATGAAGA TGCTGCGTTC
GTGCCCTTGC ACTGGCAGCA CCTCTCCTGG GCCGGCAAGG ACAATCTGAA GATCGAGCCC
ATCGTGAATA AGCAGAATTT CCCATATTTC GGGGACCTGG TTATCCAGTA A
 
Protein sequence
MKRVLCILSV LIFSILLVGS VQAKTLRLAM DADPYSLDPH VQLSGGMLQY SHLVFDPLVR 
WTKDMEFEPR LATSWERIDA TTMRFHLREG VTFHSGNPFT AKDVVWTLKR LKKSPDFKGL
FEPFEGAKAV DQYTVDVVTK KPYPLLLNMA TYIFPMDSEF YTGEDKNGNP KDAIKKIGYT
FANTHESGTG KYKVVERQQG VKVVYEAYDN YWDEDSGNVD KIILTPIKKD STRVAALLSG
DVDFIMPVPP QDYDRLEKRD GIDLVTMSGS RVITFQLNQE RRPEFANKKV RQAIVHAVNN
VGIAQKIMEG RATPAAQQAP EGFASYQPEL TPRHDVAKAK ELMKEAGYPD GFECSMIAPN
NRYVKDEKIA QAVAAMLSKI GIKANLTTMP KAQYWNKFDA QVADIQMIGW HPDTEDSANY
TEFLLMCPNK ETGYGQYNSG NYCNKEVDQF ILDAQTETDQ EKRTAMLKKV ERILYEDAAF
VPLHWQHLSW AGKDNLKIEP IVNKQNFPYF GDLVIQ