Gene Dret_0128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0128 
Symbol 
ID8417932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp167972 
End bp168925 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content55% 
IMG OID645036693 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003197008 
Protein GI258404266 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0344817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.869462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGTT TTTCCAAGAA AGTTCTTGTT GCATTGATCG CTTTTGCCGT TCTCACCATG 
AGCGCTTCAC AGGTCCTGGC GGCCAAAGAA GTGCGCTTCG CCAGCGTCAG CTGGACCGGC
GTGACCACCA AGACCGAACT GGCTGTGCGC ATCCTGCGCA GCCTGGGCTA CGAGGCCTCG
AACACCATGG TTTCCGTGCC CATTGCCTTC AAGGCCCTGG ACACCGGGGA GGCCGATATT
TTTCTCGGCA ACTGGATGCC CACCCAGGCC ACAATGGCCA ACAAATACTT CGACAAGGGC
ACCATCGAAC CGCTCGTGGC CAGTATGCCC GGAGCGAAAT ACACCCTGGC CGTGCCCACA
TACGCCTATG AAGGCGGCTT GCAGCACTTC AAAGACATCG CCAAATACGC CGATAAGCTG
GGGAATAAAA TCTACGGCAT CGAGGAAGGC AACGACGGCA ACCAGATCAT CCAATCCATG
ATCGACAAGG ACATGTTTGG ACTGGGCGAT TTCCAGCTCA TCCCTTCCAG TGAGGCCGGG
ATGCTCTCCC AGGTGCAGTC CTTCACCAAG GACGAACGCT GGATCGTCTT TCTGGGCTGG
GCCCCGCACC ACATGAACGA AATGATCGAC ATGAAGTATT TGGACGGAAG TACATCAGAG
ACCTTCGGCA AGAACGACGG TACGGCCACG GTCTACACCA TCGTGCGCGA CGGGTTTGTC
GAAGAAAACA AAAATGTCGC CAAGTTTTTG AAAAACCTCA TCTTCCCCAT CTCCATGATG
AACCAGATCA TGACCACCCT CCACGAAAAG GACGGGTTGA AACCCGTGGA TGCCGGCCTG
GATTGGGTCA AGGCCCATCC AGAGGTCTAC AAGGGATGGC TGGAAGGCGT GACCACCATT
TCCGGGGAAC CGGCTCTGCC GGCCTTTGAA CAATACCTGG AAACCGTCAA CTAA
 
Protein sequence
MSGFSKKVLV ALIAFAVLTM SASQVLAAKE VRFASVSWTG VTTKTELAVR ILRSLGYEAS 
NTMVSVPIAF KALDTGEADI FLGNWMPTQA TMANKYFDKG TIEPLVASMP GAKYTLAVPT
YAYEGGLQHF KDIAKYADKL GNKIYGIEEG NDGNQIIQSM IDKDMFGLGD FQLIPSSEAG
MLSQVQSFTK DERWIVFLGW APHHMNEMID MKYLDGSTSE TFGKNDGTAT VYTIVRDGFV
EENKNVAKFL KNLIFPISMM NQIMTTLHEK DGLKPVDAGL DWVKAHPEVY KGWLEGVTTI
SGEPALPAFE QYLETVN