Gene Htur_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3868 
Symbol 
ID8744496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp97101 
End bp98987 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content70% 
IMG OID646514453 
ProductHeparinase II/III family protein 
Protein accessionYP_003405400 
Protein GI284167122 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACGG CGCACGACCA CGACTATCCG CCGCGCGAAT GGACGGTCGG CGGCCTCCGA 
GACGCCCTCG ACGGCCCCGG GGAGGCGTTC ACGCTCCCGA CGTACGACGA CGAAGCGGCG
TGGACGGCCC TCCGCACGGA CGAACTGACC TGCGAGCCGG TCGAGGCGCT GCTCGACGAC
GCCGAATCGG CTCGCGACGG CGAGATCCCG TCGCTCACGG CCAGTCAGTA CCTCGACTAC
GAGCGCACGG GAGATCGGTC GCGCTACGAA GCCGCCGCAC GCGAACGCCG GCGTCGTCTC
TCCGCGCTCG TCGTCGCCGC GTGCGTCGAA CGCGACGACG ACTTCGATCC GATTTTGGAC
CACGCGTGGG CGCTCTGCGA GCAGGCGACG TGGACGTGGC CCGCACACCT CGGAGACGAA
TCTCGGGAGG GGCTCCCGGG CGCCGTCCCG AGCGAAGAGC GGACGGTCGC GCTCTTCACC
GTCGGCGCGG CGCTCCTCCT CGCGGAGGTC GACGCGATTC TCGGCGACCG TCTCCATCCC
GCGCTCCGTG AGCGCATCCG CGCCGAAGTC GATTGTCGCG TTTTCACTCC TTACGAGGAC
CGCGACGACA TTTGGTGGAC GACGGCAACG AACAACTGGA ACGCGGTCTG TAGCGCGGGC
GTCGCGCTCG CCGCGCTACA CCTCCTCGAC GACGCCGGCC GGCAGGCGCG CATCGTCGAA
CGCGTCGCCG ACGGTCTCGG CCACTACCTC GACGGCTTCG GCGCCGACGG CGGGACGACG
GAAGGAGTCG GCTACTGGAA TTACGGCGTG GGCAACTACG TCGCGCTCGC GGACGCCCTC
GAGAGCGCGA CCGACGGCTC GCACTCGCTG TGCTCGCCCC CGAAACTCGA GCGTCTCGCC
GCGTACCCAC TCGCCGTCGA ACTCAGCCCC GGACGCTTCG TTCCGTTCTC GGACTCGGAC
GAGGAGAGCG TCGTCGCGCC GCGCGCGGCC GCGTGGCTCG GACGCCGCCT AGAGAAGCCG
GGACTGGCGG CTCGCGGCCG GTGGGAGATG GCGCGCCGCA CGGACGCGTT CGCCGGCCCG
AACGTCGCGT CGCTGCCCGA GATCGTCCGC GACCTCCACT GGACGCGGAC GGTACCCGCG
TCGTGGACGC GTTCCACCCC GCCGACCCGT CGATACTTCG GGGGCTGTGA GTGGTGGATT
ACGCGGGCGA GCCCGGCCGA TCCGGACGGT CTCGTCGTCG CCGCGAAAGC CGGCCACAAC
GGCGAGTCGC ACAACCACAA CGACTGCGGC TCGTTCGTCG TTCACGCGAA CGGCGAGTCG
CTCCTCACCG ATCCGGGGCG TCCCGAGTAC GACCGGGACT ACTTCGGTCC GGCCCGCTAC
GAGTACATCA CCGCGCGCTC GCTCGGCCAC TCCGTTCCGT ACGTGAACGG CGTCGAGCAG
ACCGCCGGGG AGGCGTTCGC CGCGTCGGTA CTCGACCGAC GCTCCTCGCC GACGGTCGAC
GCGTTCGAGA TGGAACTCGC CGACTGCTAC CCCGAGGACG CCGGTCTCGA GTCGCTCCGC
CGGACCGTAA CGCTCGACCG AACCGACGGC GTCGTCACGG TCGGCGACGA CGCGGTGTTC
GCGAACGCGG ACAATACGTT CGAGTCCACG CTCGTCTCCG CGTTCCCGAT TCGAAGCGAC
GAGCGAGGAC TCGTCGTCGA CGGCGAACGC GGTCGTACGC GGGTGACGCC GGACGATTCG
GACGCCGAAC GCAGCGTCGA ACGGCTTACG GACGCGATCG AGACGGCCGA CGGGACGCGC
GACGTCTGGC GCGCTCGCAT CGAACGGACC GTCAGTAGCC GCGCGACGTC GCTACAGCTA
CGGATCGAAC CCGAGAGCAG AGAGTAA
 
Protein sequence
MPTAHDHDYP PREWTVGGLR DALDGPGEAF TLPTYDDEAA WTALRTDELT CEPVEALLDD 
AESARDGEIP SLTASQYLDY ERTGDRSRYE AAARERRRRL SALVVAACVE RDDDFDPILD
HAWALCEQAT WTWPAHLGDE SREGLPGAVP SEERTVALFT VGAALLLAEV DAILGDRLHP
ALRERIRAEV DCRVFTPYED RDDIWWTTAT NNWNAVCSAG VALAALHLLD DAGRQARIVE
RVADGLGHYL DGFGADGGTT EGVGYWNYGV GNYVALADAL ESATDGSHSL CSPPKLERLA
AYPLAVELSP GRFVPFSDSD EESVVAPRAA AWLGRRLEKP GLAARGRWEM ARRTDAFAGP
NVASLPEIVR DLHWTRTVPA SWTRSTPPTR RYFGGCEWWI TRASPADPDG LVVAAKAGHN
GESHNHNDCG SFVVHANGES LLTDPGRPEY DRDYFGPARY EYITARSLGH SVPYVNGVEQ
TAGEAFAASV LDRRSSPTVD AFEMELADCY PEDAGLESLR RTVTLDRTDG VVTVGDDAVF
ANADNTFEST LVSAFPIRSD ERGLVVDGER GRTRVTPDDS DAERSVERLT DAIETADGTR
DVWRARIERT VSSRATSLQL RIEPESRE