Gene Htur_5149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5149 
Symbol 
ID8745697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp43684 
End bp44760 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID646515506 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_003406453 
Protein GI284176176 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.735215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATC CCAACGGCGA CGAGAGACGA ACCGGCCATC CAGACACCTC GGGGTCCGCG 
CTCCGGCGGC GGTCCGTTAT CGCCGCTGCG GGGGCGTCGG CGATCGGCAT GGTCGGCGCT
GCGAGCGCCG ATCGCGGCCG CGGGACCGAG CGCGATCGAT CCGCCGGCTC CGACAATCGA
CAGCGCAACC GATCGCGCGA ACGCGGGTTC GTCGATCGGA CCGACGAGCC GGATCTGATC
GCCCACCGCG GATTCGCCGG ACTCTACCCC GAGAACACCG TCGGCGCCGT CGAGGCGTCG
GCCCGCGGTA TCCGGTCGCC GTACGCGCCG TCCCGCGGGG CGAACATGAT CGAAATCGAC
GTCGTTCCGA CCGCCGACGG CGACGTCGTC GTCTTCCACG ACGACCGTCT CGCCGAGCGC
GACGGCGGCG AGCGCGGCCT CACCGACACC GAGGGCGTCG TCTGGGAGAC CGACACTGAG
ACCGTCACGA GCGCCGAAGT GCTCGAGAGC GGCGAGACCG TTCCCCGACT GCGCGAGACT
CTCGCGGCGA TTCCGTCCCA CGTCGGCGTC AACGTCGAGC TGAAGAACCC GGGCTCGTTC
GACGTTCGAT TCGCCGAGTC GCTCTCGAGC GAGGAACTCG CGGGGCAGAA AGAGCTCTGG
CAGCCGTTCG TCACCGACGT GCTCGCGGTC GTCGACGACT TCGACCACGA GTACCTCTTC
TCGTCGTTCT ACGAGGCGGC GCTAGCGACG ACCCGCGAGG CGTCGGACTA CCCGGTCGCG
CCGCTGCTCT GGGACTCCGT CGAAGCCGGC CTCGAGGTCG CCCGCCGCTA CGAGGCCGAG
GCGATCCATC CGCCGTACGA TATGATCCGC GATACGCCGT TCTACGCCGA CCAGCACTAC
GCGGAGGACG CCGGCTGGGA CGAGATCGAC CTCCTCGCGG TCGCCAACGA GGAAGGGCGG
GACGTGAACG TCTTCACCCT CGAGACCTGG TACCAGGCCG ACCAGTTGGC GGCGGCCGGC
GTTGACGGGC TGATCAGCGA CTACGCCGAC GTGCGCCGGT TCGGCGTGAC GAACTGA
 
Protein sequence
MSDPNGDERR TGHPDTSGSA LRRRSVIAAA GASAIGMVGA ASADRGRGTE RDRSAGSDNR 
QRNRSRERGF VDRTDEPDLI AHRGFAGLYP ENTVGAVEAS ARGIRSPYAP SRGANMIEID
VVPTADGDVV VFHDDRLAER DGGERGLTDT EGVVWETDTE TVTSAEVLES GETVPRLRET
LAAIPSHVGV NVELKNPGSF DVRFAESLSS EELAGQKELW QPFVTDVLAV VDDFDHEYLF
SSFYEAALAT TREASDYPVA PLLWDSVEAG LEVARRYEAE AIHPPYDMIR DTPFYADQHY
AEDAGWDEID LLAVANEEGR DVNVFTLETW YQADQLAAAG VDGLISDYAD VRRFGVTN