Gene Htur_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1410 
Symbol 
ID8742001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1462642 
End bp1464012 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content67% 
IMG OID646511988 
Productprotein of unknown function DUF21 
Protein accessionYP_003402971 
Protein GI284164692 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCC TCGAGGCGAC CGCGGTACTG GCCGACGTCG CTCGCCCGGT CGCCGCGTCG 
GGAGTCGTTG GTTTCGAGCC GTCGACGACC CTCGTGGCCG CCGGCGGCGT CGCGGCGCTG
CTGGTGTTGC TGGTCCTCTC GGGGTTTTTC TCCTCGGCCG AGATCGCCAT GTTCTCGCTG
GCCCACCACC GCATCGAGGC GCTCGTCGAG GACGGGGCGT CCGGCGCCGA GACCGTCCAG
GCGCTGAAGG ACGACCCCCA CCGACTGCTG GTGACGATCC TCGTCGGGAA CAACCTCGTC
AACATCGCGA TGTCCTCGAT CGCGACGGGA CTGTTCGCGA TGTACACGAG CCAGGGGCGG
GCGATGCTGG CGGCGACGTT CGGCGTGACG GCCGTCGTCC TGCTGTTCGG TGAGAGCGCG
CCCAAGTCCT ACGCTATCGA GAACACCGAA TCGTGGGCAC TGTCGGTCGC TCGTCCCCTC
AAAGTCTCGG AGTACGCGCT GTTCCCGCTC GTGGTCACGT TCGACGCGCT GACACGGGTG
CTTAACCGCC TGACCGGCGG CACGGCCGTC GAGGAGTCGT ACGTCACCCG CGAGGAGATC
CGGGAGCTGA TCCGGACCGG CGAGAGCGAA GGGGTCATCG AGGCCGACGA ACGCGAGATG
CTCCAGCGCG TGTTCCGGTT CAACGACACC ATCGCCAAAG AGGTGATGAC GCCGCGATTG
GACGTCACCG CCGTCGCTCG AGAAGCGACC GTCGACGAAG CCGTCGCGAA GTGCGTCGAG
AGCGGCCACA CCCGCCTGCC GGTCTACGAC GGCGATCTCG ATACCGTCGT CGGGATCGTC
GCGCTCGGCG ACCTCGTCGG TGATCGCGAG TCGACCGACG ACGGCTTGCT CGAGGCCCAC
GTCGAGGAGA CGCTGCACGT CCCCGAGAGC AAACACGTCG ACGAGCTGTT CCGCGAGATG
CGCCAGCAGC GGGTCGAACA GGTCGTCGTC ATCGACGAGT TCGGGACGAC GGAGGGGATC
GTCACCACCG AGGACATCGT CGAGGCCGTC GTCGGCGAGA TCCTCGAGAC CCAGGAGGAC
GACCCGATCG AGACCGTCGA CGACCGAACC GTCCGGGTCG ACGGCGAGGT GAACATCGAG
GCCGTCAACG ACGTCACCGG CGTCGAGTTC CCAGAGGGCG AGGAGTTCGA GACGATCGCC
GGCTTCGTCT TCAACCGCGC CGGCCGACTG GTCGAACCCG GCGAAACGTT CGCCTACGAC
GGCGCCGAAC TGACCGTCGA ACGCGTCGAC GATACGCGCA TCAGGCGGGT GCGCATCAGC
GAGTCGGAGC CTTCGGTAAC GGACGGCTCC GGTGTCGCCG CCTCGAGTTA G
 
Protein sequence
MPGLEATAVL ADVARPVAAS GVVGFEPSTT LVAAGGVAAL LVLLVLSGFF SSAEIAMFSL 
AHHRIEALVE DGASGAETVQ ALKDDPHRLL VTILVGNNLV NIAMSSIATG LFAMYTSQGR
AMLAATFGVT AVVLLFGESA PKSYAIENTE SWALSVARPL KVSEYALFPL VVTFDALTRV
LNRLTGGTAV EESYVTREEI RELIRTGESE GVIEADEREM LQRVFRFNDT IAKEVMTPRL
DVTAVAREAT VDEAVAKCVE SGHTRLPVYD GDLDTVVGIV ALGDLVGDRE STDDGLLEAH
VEETLHVPES KHVDELFREM RQQRVEQVVV IDEFGTTEGI VTTEDIVEAV VGEILETQED
DPIETVDDRT VRVDGEVNIE AVNDVTGVEF PEGEEFETIA GFVFNRAGRL VEPGETFAYD
GAELTVERVD DTRIRRVRIS ESEPSVTDGS GVAASS