Gene Htur_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2472 
Symbol 
ID8743081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2535599 
End bp2537110 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID646513058 
Producthypothetical protein 
Protein accessionYP_003404023 
Protein GI284165744 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACC GAACGCCCCG GCTCGGACTC GAGACGTTCG AGGAGGGCGA CGCGTGGGAT 
CACACCGACA CGGTCGAAGC CGTCGACGAA CACGCGATTG TCCGGGGACC GATCGCCGAC
CGTCCGGACG AGGGCGAGTA CGACGACGAA CTCTACCACG CGAACGATCA GGGGATCACG
TGGCGCTGGG ACGCCTCGAG CGAGGACTGG ACGTACTTCG GCGGCAAGGG CTGTTCGGAG
CAGCCGATAC CGGGGACGAG TCACTTCGAG GCGGCGGAAC TCGTCCACGC GCGCACCGAG
GAGACCCCCG TCTGGAACGT CGAAGCCCAC GGGATCGAGG GCGACGGCGA GACGGAAGTC
GGGGCGGCCG TCCACGACCT CCTCGCGGAC GTCGCTGAGG CCGGCGGCGG GATCGTCTAC
TTCCCGCCCG GCCGATACCT CCTCGAGCGG ACGCCGCTGA TCGGCGACGA TACGCTCCTG
CTGGGCGCGG GTCGCGCGAC GGTCCTCGAG GGAACGCGCC CCGAGGACGA GGAAGGCCGG
GCGCTGCTCT CCAACAGGGG CTACGACGCG GTCGATTTCG ACGGCGCGTC GGATTGGGCG
ATCTGTAACG TCCGAATCGA TTCGCCGGCC ACGAACGGGA TCATGCCTGC ACACGCGGAG
AACGTCCGGC TGGAACGAAT CTACGGCGAC CGGATCTACT ACCACCACAT CGACGTCGTG
TCCTCGAAAA ACGTCGGGAT CGACGGCTAC TGGGCGACTC GAGGCGGCGA GGCCGACTCG
GACGCGCCGA TTCAGTTCGA CAACCAGACC ACGGAAATCG CGTCGAACAG CGTCTGGAAC
GGTAACGAGG AGCTACTGGC CGGGAGCGAC GGCACCCCGA CGCGGAACTG CACGCTCGAG
AACTTCGAGA TCGACCCTGC GAACGGTCCG GAGTACGGCG TCCACATGCA CCGGAACGGC
AACGAGTCGA TCACCATCAG GGACGGGTAC ATCACCGGTT GCCTCTATTC GGCGATCCGA
GGCGACACCG GCGACGCGAT CGAGGACCTG ACGATCGACT CCGTCTCGTG TATCGAGAAC
GCGCGGGGGA TCTCGCTCGG ACATATCAAG GGCGGCCGAC GAGAGCTGAC CATCAGCAAC
GTCACGATCA GAACCGACAA CAGGGGGCTG GCCGCCGGCT CGGGACTGTA CGCGGCCGGG
TTCGACGGCG CCGAGATCTC GAACACCGTC GTCGACGGCG AGTTCACGAA CGCGATCCTC
TTCGACGACA TGGACGACCT GAAGCTGAGC ACCGTGACGG CCAAGGGCGC GAGGGATCAG
GCGTTCCGAT TCCGGAATAA CGTCGACGCG ACGCTGACGA CCGCTCGAGC GGCCGAGTGC
GGCGACGCGG GCGTCTACGT GGGAGACGGC AGTAGCATCG CCTACGGCGG CGTCACGTTC
GACGATGTCG GCAGCGAAGT CGACATCCAC GACGATGGGA CGTTACGGGA GTGGACCACC
TCCTCGTCGT GA
 
Protein sequence
MSDRTPRLGL ETFEEGDAWD HTDTVEAVDE HAIVRGPIAD RPDEGEYDDE LYHANDQGIT 
WRWDASSEDW TYFGGKGCSE QPIPGTSHFE AAELVHARTE ETPVWNVEAH GIEGDGETEV
GAAVHDLLAD VAEAGGGIVY FPPGRYLLER TPLIGDDTLL LGAGRATVLE GTRPEDEEGR
ALLSNRGYDA VDFDGASDWA ICNVRIDSPA TNGIMPAHAE NVRLERIYGD RIYYHHIDVV
SSKNVGIDGY WATRGGEADS DAPIQFDNQT TEIASNSVWN GNEELLAGSD GTPTRNCTLE
NFEIDPANGP EYGVHMHRNG NESITIRDGY ITGCLYSAIR GDTGDAIEDL TIDSVSCIEN
ARGISLGHIK GGRRELTISN VTIRTDNRGL AAGSGLYAAG FDGAEISNTV VDGEFTNAIL
FDDMDDLKLS TVTAKGARDQ AFRFRNNVDA TLTTARAAEC GDAGVYVGDG SSIAYGGVTF
DDVGSEVDIH DDGTLREWTT SSS