Gene Dret_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1056 
Symbol 
ID8418881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1247979 
End bp1248896 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content58% 
IMG OID645037628 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003197922 
Protein GI258405180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.511062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0333795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAAAC TTCGCAATCT GTCTGTGTGC AGTCTTGTAC TGACCTTATT GGTTGTGGCC 
GGCGTGGCCC GGACAAGCTC TGCCGGGCAC GAAACACCGG TGACCTTGGC GTATGTCGAG
TGGTCTTCAG AGGTGGCGAG CACGAATCTG GTCAAAGCAG TTATTCAGGA GAAACTCGAC
CGCCCGTGCC GCATTGTGGC CATGTCGGCC GATGAGATGT GGCGGGCTGT GGCCCAGGGA
TCTGTGGACG GCATGGTATC GGCTTGGCTT CCGGGAACAC AGAGCGAATA TTTGCAACGC
TATGAAGGCC AGGTCATTGA TCTGGGACCG AACCTGGAGG GAACCCGGAT CGGTCTGGTC
GTTCCCAAGG TGACCACCGG CCGACAGACA GCCGGCAGTG GTCTCGAGAA CGAACCTTAC
ATCCCTGTGA CCTCTATCGC CGAGTTGCCG GAGTATGCAG ACAAATTCGA CGGCAAGATC
ATCGGCATTG ATCCGGAAGC CGGGATCATG CACCGGACCG AGGAAGCGTT GCGCGCCTAT
GGGCTGCACA ATTATACCCT GGTCTCCGGC AGTGAAGTGG CCATGACCGC TGAACTGGCC
GATGCCATCC GCAAACGGCA GTGGGTGGTG GTTACCGGCT GGGAACCGCA TTGGATGTTC
GCCCGCTGGC GCTTGGCCTT CCTCGACGAC CCCAAAGAAG TCTTTGGCGG ACGGGAAGCG
ATCCATACCG TTGTTCGCTC GGACTTGCGG GAGGATATGC CGGACGTCTA CAGGTTTTTG
GACAACTTCC AATGGACATC AGAGGATATG GAACAACTCA TGCTCTGGAT AGAAGAGCGA
AAGGGCGTTT ACCCATATGA AAGCGCCCGC CGCTGGATCC GGTACCATAA GGATCAGGTG
GCCTCATGGC TGCCGTGA
 
Protein sequence
MLKLRNLSVC SLVLTLLVVA GVARTSSAGH ETPVTLAYVE WSSEVASTNL VKAVIQEKLD 
RPCRIVAMSA DEMWRAVAQG SVDGMVSAWL PGTQSEYLQR YEGQVIDLGP NLEGTRIGLV
VPKVTTGRQT AGSGLENEPY IPVTSIAELP EYADKFDGKI IGIDPEAGIM HRTEEALRAY
GLHNYTLVSG SEVAMTAELA DAIRKRQWVV VTGWEPHWMF ARWRLAFLDD PKEVFGGREA
IHTVVRSDLR EDMPDVYRFL DNFQWTSEDM EQLMLWIEER KGVYPYESAR RWIRYHKDQV
ASWLP