Gene Dret_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1033 
Symbol 
ID8418856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1216265 
End bp1217332 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID645037603 
ProductDRTGG domain protein 
Protein accessionYP_003197899 
Protein GI258405157 
COG category[R] General function prediction only 
COG ID[COG0857] BioD-like N-terminal domain of phosphotransacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00622763 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGAT TGTACATTGG GTCCACAAGC GGATTTGCGG GCAAAAACAT GGTGACCATG 
GCCCTGGGGC TGCATTTGCA GAAGGAGGGT CATCTTGTCG GCTATATGAA GCCGATCGGA
GCGGTTCCGA GCAAAGGCAA CTCCCGGGAA GGCGATGCCG ACGCCTTTTT TGTCCAGGAT
GTCCTCGGGC TCCAGGAAGA CCCTAACCTG GTCACTCCGG TCCTGGTGAC TGAAGAATTC
AAGCGCGAGG CCTTTACCAG CTCCTGCCCC CAATTGTTGT CACGGGTGCA AACGGCCTAT
GAAACCCTGG AAAAGGGCAA GGATCTCGTC CTTGTCGGCG GGTCCGGCAG TTTCCTGTAC
TCCGGAAAAT ACTGCGGCGT CGACGGCCTT AGCGTCAGCA CCGGGCTCAA GACCAAAGTT
CTGCTGATCG ACCGCTTCCG CAGTGAATGC AACTACGATT ATCTGTTAAC GGCCAAGGAA
CTGTTGGGAG ACCGGTTGAT CGGTGTGATC CTCAATGACA TCCCTGCAGC GCAGATGGGC
GAATTACAAG GTGGTGTCGT CCCCCTGCTG GAGCGTCAGG GTGTCCCCGT GCTCGGTCTC
ATCCCCCACG ACCCACTCAT GGGGGCCATT AAAATCGCCG ATCTCGCTGA ACGCCTGGGC
GGACGGATTA TTTCCGCTCC TGGTAAAGCG GACCGGGTCA TTGAGAACTT CCTCATCGGC
ACCATGCAGG TCGAAAATTT TATGACCCAC TTCCGCCGGC ACCAGAATTC AGCGATTCTC
GTTGGCGGCG ACCGATCCGA TTTGCAGTTG GTGGCCCTGG AAGGCAAATG CCCTTGCCTC
ATATTGACTG GCAACCTGTA CCCCAATGAT ATCATCCTCA CTCGCTCTGA AGTCCTGGAA
ACCCCGCTCA TCGTCGTTCG TGAAGACACC TATAGCGTGG CCCAGAAAAT GGAGCGCATT
CTCGGCTCCG TCAAATTGCG GGACATGATC AAAATCAACC ACGGCGCCCA ACTCGTCAAC
AGCGCTGTCG ATTTCGCGGC CATCAAACGG GCCCTGCAAC TCCAGTAA
 
Protein sequence
MPGLYIGSTS GFAGKNMVTM ALGLHLQKEG HLVGYMKPIG AVPSKGNSRE GDADAFFVQD 
VLGLQEDPNL VTPVLVTEEF KREAFTSSCP QLLSRVQTAY ETLEKGKDLV LVGGSGSFLY
SGKYCGVDGL SVSTGLKTKV LLIDRFRSEC NYDYLLTAKE LLGDRLIGVI LNDIPAAQMG
ELQGGVVPLL ERQGVPVLGL IPHDPLMGAI KIADLAERLG GRIISAPGKA DRVIENFLIG
TMQVENFMTH FRRHQNSAIL VGGDRSDLQL VALEGKCPCL ILTGNLYPND IILTRSEVLE
TPLIVVREDT YSVAQKMERI LGSVKLRDMI KINHGAQLVN SAVDFAAIKR ALQLQ