Gene Htur_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3497 
Symbol 
ID8744117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3597814 
End bp3598854 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID646514078 
Productluciferase family oxidoreductase, group 1 
Protein accessionYP_003405032 
Protein GI284166753 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTCT CGATCGTCGA TCTCGCACCG ATGCCGGAGG ACGGTACCGC GACGGAGGCG 
TTCGAACACA CGATCGAACG CGCCCGGCGG GCCGAGCGAC TCGGCTACTC GCGGTTCTGG
GTGGCCGAAC ACCACGACTT CACCGACTCG GTGGCGAGCA CGACGCCGGA GGCCCTGATC
CCCTACGTCG CCGCGAAGAC GGAGGACATC CGGGTCGGCT CGGGCACCGT CCTGTTGAAC
CACTACAGCC CGTACAAGGT CGCGGAGACG TTCGGCGTCC TCGACGCCTT GGAGCCCGGC
CGGATCGACC TCGGCCTTGG CCGGGCGACG GGAAACCCCG CGAGCGATCT CGCCCTCCAG
CCGGATCGCA GCCAACGGCG GCGAACCGGC GACGATCAGG CGGAGAAGGT CGAGGAGGTC
GCCAACCATC TCTACGGCGG CTTCGACGAC GACCACCCGT TCCGCGACCT CGAGGTACCC
CGATCGGGCG ACTCCGCGCC CGAAATCTGG GTCCACGGCT CGAGTCCACA GAGCGCGACG
ATCGCCGGCG AACTGGGACT GCCGTACTGT TTCGCCGCGT TCATCCGCCC CGAGCCGGCG
GTACAGGCGT TCGAGACCTA CCGGGAGCAC TTCGAGCCCT CGCCGGACGG CGCCGGCCTC
GAGGCGCCCC GCGGCGCCAT CGCGGTGAAC ATGACCTGTG CCGAGACGGA CGAAGAGGCC
GCGCGGCTCC GCGCGACCGC CGAGGCCTCG TCGCGACTGC TCCGCAGCGG GCGGGTCGAC
CGACTCCCGA TTCGGTCGGT CGACCGGGCG ATCGACGTCC TCGGCGACGC TCCCGACCCG
ACGCCGACGG ACATCGAGCC CGGCGAGTGG CCTCGGCACC TCTCCGGCGG ACCGGAGACG
GCCCGCGAGA TCCTCGAGGA ACTGACCGCA CAGGCCGGGG TCGACGAGGT CGTGATCCAG
AGTCAGCACG CCGACCCCGA GACGACGCTG CGCTCGCACG AACTGCTCGC CGACGCCGTC
GGCCTCGAGG CGCGCGAATA G
 
Protein sequence
MELSIVDLAP MPEDGTATEA FEHTIERARR AERLGYSRFW VAEHHDFTDS VASTTPEALI 
PYVAAKTEDI RVGSGTVLLN HYSPYKVAET FGVLDALEPG RIDLGLGRAT GNPASDLALQ
PDRSQRRRTG DDQAEKVEEV ANHLYGGFDD DHPFRDLEVP RSGDSAPEIW VHGSSPQSAT
IAGELGLPYC FAAFIRPEPA VQAFETYREH FEPSPDGAGL EAPRGAIAVN MTCAETDEEA
ARLRATAEAS SRLLRSGRVD RLPIRSVDRA IDVLGDAPDP TPTDIEPGEW PRHLSGGPET
AREILEELTA QAGVDEVVIQ SQHADPETTL RSHELLADAV GLEARE