Gene Htur_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1006 
Symbol 
ID8741591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1037323 
End bp1039572 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content71% 
IMG OID646511584 
Productvon Willebrand factor type A 
Protein accessionYP_003402573 
Protein GI284164294 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1239] Mg-chelatase subunit ChlI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGCAG AGTCCGGAGA CAAAAAGCTG TCGTCACTCC CCTTTCCGGC CATCGTCGGC 
CAGAACGAGT TGAAGCGGGT GCTGCTCGCC GTCGCCACCA ACGACGGCCT CGACGGCGCC
CTGATCGTCG GCGAGAAGGG GACCGCGAAG TCGACCGCGG TGCGAGGGCT CGTGGATCTC
CTCCCCGAAC AGCGGGCCGT GGCCGACTGT CCGTACGGCT GTGCGCCCGA CGCGCCGAAT
CTGCAGTGCG ACGACTGCCG CGAACGCGAT CCGGAAGCGA TGCCCGTCGA AACCCGGCCA
GTTCCGCTCG TGACGCTTCC GCTGGGTGCG ACTCGAGACC GCGTCGTCGG GACCCTCTCG
GTCGAGGACG CGCTGGCCGG CGCGGCCGAC TTCGATCCCG GACTGCTCGC CCGCGCTCAT
CGGGGCATCC TCTACGTCGA CGAGGTCAAC CTGCTGGACG ACCACCTCGT GGACGTCATT
CTCGACGCGG CCGCCAGCGG GGTCAATACG GTCGAGCGTG ACGGAGTCAG CGTCTCTCAC
CCCGCCGAGT TCACCCTCGT CGGGACGATG AACCCCGAGG AGGGCGACCT CCGCCCCCAA
CTGCGGGACC GCTTCGCCCT GCAAGCCACC GTCGAGGGCT GTCGTGAGAT CGACCAGCGC
GTCGAGATCA TCGATCGGGC GCTCGAGACG GACGCCAGTG GCGCTGGCGG ATCTGACGGG
CCGGAACCCT GGACCGAGTA CGCCGACGAG ACGGCGACCC TGCGCGAGGA ACTCGCGGCG
GCCCGCGACC GCCTTGCCGA AGTCACGCTC CCGGACGATT TCAAGGCGGA GATCGCCGAC
CTCTGTCTCG AGGCCGGCGT CGACGGCCAC CGCGGCGACG TGGCGACCGC TCGGACGGCG
ATGACCATTG CCGCGCTCGA GGGCCGAACG ACGGTCATCG AGTCCGACGT TCACGAGGCC
GCCAGCTACA CGCTTCCCCA CCGCCTCCGG AGCACGCCGT TCGAAGACGA ACCCGACGTC
GAGGACCTGC TCGAGGACCG CTTCGACGAG GAGTCGACGG ACGAGAGCGA AAGCGAGGAC
GATGCGGACG GCGACGAGTC CGCGGACGGA GGCGACGGAG ACGAGTCCGA GACCGAGAGC
GACGACGGTG AGACGGGAAC CGAACCCGGC GGCGAGCGCA GCGACGGTGA CGGTGACGAC
GGCGAATCCG ACGGGAACGA CGGCGACGCC GATTCCGAGA CCGACGACTC GAGTCCCGAG
CGTCGACCCG ACGAGGGCGA CGACCCGACC GGCGGTGAGC CGTCGCCCGC CGAAGCCGGC
AACGGGAGCG ACGAGACCGG CGACTCGAGC GAATCCGATT CGAACGACGG TGACGGGAGC
GACGGAGACG GCGAAAGCGG CGACGAGGGA GACGAGAACG CCCAGCCGCT CGTCCCCGGC
CAGCAGCGCG CCGACGTCGC CGCGGTGGGC GAGGCCACGT CGCCGGACCT CGAGACGCCC
GACGCCGAGA GCGCGGACGC GGCGACGGCG AGCGGTTCGC GAGCGAGCAC GGCCCCAAGC
ACGGACAACC GCGGTGCTCG AGTTCGGACC GAACCCGCGT CGGGGGACGG CCCTATCGAC
GCGGCGGCGT CGGTTCGCTC CGCCGCGACC CGCGGCGACT CCCGGGTACA GAAGCGGGAC
TTGCGTCAGT CGGTCCGCAC CGGCGACACG TCGGCGACGA TCGTCTTCGC GGTCGACGCC
AGCGCCTCGA TGCGCCCCGC GATGCGCACC GCGAAGGGCG TCGTCCTTGA TCTGCTGCGG
GACAGCTACG AGCACCGCGA TCGGGTCGCG TTCGTCGCCT TCGCCGGCGA AGACGCCGAC
GTCCTCCTGC CGCCGACCGA CAGCGTCTCG CTGGCCGCGC GCCACCTCAA GGAACTCCCC
TCGGGCGATC GGACGCCCCT TCCCGCGGGC CTCGAGACCT CGCGGCGAGT CCTCGAGCGC
GCGGAGACGG ACGCCTCGGT CGTCGTCTGC GTGACCGACG GCCGGGCGAA CGTCGCCGAC
GGCAGTCCGA CCGAGGCGAC GCGACGGGCG GCCCGAGGGC TCGCGGCCGA GGACGCGACC
GTGATCGTCG TCGACGCCGG CGACGACTCT CGGGCCGGCC TCTCGGAGCT GGTCGCCGCC
GAAACGGGGG GCGAAGTGGT CGCCCTCGAG GCGCTGTCGT CCGAGACGGT CCGAGCGGCG
GCGGACCACG CCGCATCGGG GGGACAGTAA
 
Protein sequence
MVAESGDKKL SSLPFPAIVG QNELKRVLLA VATNDGLDGA LIVGEKGTAK STAVRGLVDL 
LPEQRAVADC PYGCAPDAPN LQCDDCRERD PEAMPVETRP VPLVTLPLGA TRDRVVGTLS
VEDALAGAAD FDPGLLARAH RGILYVDEVN LLDDHLVDVI LDAAASGVNT VERDGVSVSH
PAEFTLVGTM NPEEGDLRPQ LRDRFALQAT VEGCREIDQR VEIIDRALET DASGAGGSDG
PEPWTEYADE TATLREELAA ARDRLAEVTL PDDFKAEIAD LCLEAGVDGH RGDVATARTA
MTIAALEGRT TVIESDVHEA ASYTLPHRLR STPFEDEPDV EDLLEDRFDE ESTDESESED
DADGDESADG GDGDESETES DDGETGTEPG GERSDGDGDD GESDGNDGDA DSETDDSSPE
RRPDEGDDPT GGEPSPAEAG NGSDETGDSS ESDSNDGDGS DGDGESGDEG DENAQPLVPG
QQRADVAAVG EATSPDLETP DAESADAATA SGSRASTAPS TDNRGARVRT EPASGDGPID
AAASVRSAAT RGDSRVQKRD LRQSVRTGDT SATIVFAVDA SASMRPAMRT AKGVVLDLLR
DSYEHRDRVA FVAFAGEDAD VLLPPTDSVS LAARHLKELP SGDRTPLPAG LETSRRVLER
AETDASVVVC VTDGRANVAD GSPTEATRRA ARGLAAEDAT VIVVDAGDDS RAGLSELVAA
ETGGEVVALE ALSSETVRAA ADHAASGGQ