Gene Dret_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1034 
Symbol 
ID8418857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1217503 
End bp1218711 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content56% 
IMG OID645037604 
Productacetate kinase 
Protein accessionYP_003197900 
Protein GI258405158 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00869824 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.811028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACTC TCGTCATTAA TTCCGGAAGT TCCTCCATTA AATACAAACT TTTCGATATG 
GAGAGCGAGG CTGTCCTTGC TGCCGGCGTT ATTGAGCGCA TCGGCGAAGA GAGCAGCAAG
CTCGAGCACA AAAAATATCC TGGCACCGAA CGCGAGGGCA AGACGGAACA AAACGATCGC
GTGGCCAATC ATGAAGAAGG CTTGGGCAAG GTTGTCGCCC TGCTCACGGA CGCCGACTAC
GGGGTCATCC GCGACCGCGC GGAAATCGAT GCCGTGGGAC ACCGCGTCGT GCATGGCGGG
GAAGCCTTCC ACGCCCCGAC CGTGATCGAC GATTCGACCA TTGCGGCCAT CGAGGCCAAC
GCCTCCCTGG CCCCCTTGCA CAATCCGGCC AACCTGACCG GCATCAAGGT CGCCCGGGAG
ATGTTCAAGG ATGTGCCTCA GGTGGCTATC TTTGACACCG CTTTTCATCA GAGCATGCCG
GCCAAGGCCT ATCAATACGC TATCCCCTAC GCCCTGTATA AGGAATTGGG GATCCGCCGC
TACGGGTTCC ACGGTACCTC CCATCGCTAT GTGACGAAAA AGGCCGCCGC ATTGCTCGGG
AAAAAGGAAG ACGAGGTCAA TCTGATTACC GTCCACCTGG GCAACGGCAG TTCCATGAGC
GCCATCAAAA ACGGCAAATG CGTGGACACC TCTCTGGGCA TGACTCCCTT GGCCGGCTTG
GTCATGGGAA CCCGCTGCGG GGACATCGAC CCCGCGGTCC ATGCCTTTTT AGCCAAGCAA
AAAGGCATGA GCATCGAGGA AATCGATACC CTGTTCAATA AAGAAAGCGG ACTCAAGGGC
ATCTGCGGCA TGAACGACAT GCGCGATATC CATACGGCCC GGGAAAAGGG CGACGCCCAG
GCCCAACTTG CCGTGGACAT GCTCACCTAC CGCAATAAGA AATACATCGG CTCTTATCTG
GCCGTACTGG GCCGCGTGGA CGCCATTGTC TTTACCGCCG GCATCGGGGA AAACGACTCT
GACGTCAGAG CTTTGAGCCT GGAAGGGTTG GAAAACTTCG GGATCGAAGT CGATCAGACC
AAAAACGGTG AACGCAAAAA AGAGGCCCGG TTTATCAATA CCGAGTCCAG CACGGTGAAA
GTCATGATCG TGCCCACGGA CGAAGAGTTG GAAATCGCGC AACAGACCAT GGAGATAGTC
GGCCAATAG
 
Protein sequence
MKTLVINSGS SSIKYKLFDM ESEAVLAAGV IERIGEESSK LEHKKYPGTE REGKTEQNDR 
VANHEEGLGK VVALLTDADY GVIRDRAEID AVGHRVVHGG EAFHAPTVID DSTIAAIEAN
ASLAPLHNPA NLTGIKVARE MFKDVPQVAI FDTAFHQSMP AKAYQYAIPY ALYKELGIRR
YGFHGTSHRY VTKKAAALLG KKEDEVNLIT VHLGNGSSMS AIKNGKCVDT SLGMTPLAGL
VMGTRCGDID PAVHAFLAKQ KGMSIEEIDT LFNKESGLKG ICGMNDMRDI HTAREKGDAQ
AQLAVDMLTY RNKKYIGSYL AVLGRVDAIV FTAGIGENDS DVRALSLEGL ENFGIEVDQT
KNGERKKEAR FINTESSTVK VMIVPTDEEL EIAQQTMEIV GQ