Gene Htur_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3579 
Symbol 
ID8744199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3681235 
End bp3683559 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content71% 
IMG OID646514160 
Producttransglutaminase domain protein 
Protein accessionYP_003405114 
Protein GI284166835 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGA ACTCCGGTTC CCAGACGGGC GAGCGGACGA TCGAGATCGC GCCGGACGGA 
TCGGTCGGTC CCGGAACATT CCGGCTGCTC GCGCTCTGTT GCGTGCTGGT CCTGACGGCG
TCGTACGTGA GCGTCCTGCG CGACGTGACG CGAGTCGTCG GCGGGACCGA GACGCTGTTG
GCCCTCGTCG TCGCGATGGG AGTCGCGGCG ACGGTCCTCG CGTGGGCGAT CCGCCCGCGG
ACCGCGACGG TCGCCGCGCT CGTCGCGGGC GCGCTCGGCT TCGCCTACTA CCTCGAGTAC
ACCGGCCTCG GCGTCGAGGT GCTGCTCTCG GCGGGCGACG TGTTGCTCTC GGACGCCGCC
GCGCTCGCGA CCGGCCTCCC CCTGCTCCGG ATGGTCGAAG CGGGCGTCTG GACGCTCGCG
TTCGTCCCCG CGCCCGTCTT CCTCTCGTGG TATCTCGCCG TCCGGGGTCG GTACGCGCTC
GGCGTCGTTC CCGGCGGTAT CGCGCTGCTC TTTCTCGTCC TGACCGGCGA CGCCGGGACG
GTACCCACGC TGGTGGGGAC GCTGGCCGGG ATCGGCGCCG TCGGCTGCGG CGAACTCGAG
CGCCGCGGCG GTTCGGTCGC CCAGGCGGAC CTGCTCACGG CCCTGTTCGC GCTGATCGTC
GTCCTCTCGC TGACCGTCAC GGTCGTCCCC GGCGGCTCGG GGACACCGGG CCACTTCGCT
GGGACCGAGA CCGGGACGCT CGAGGGGACC ATCGACCACG CCTCGGATCG CTCGAGCATC
GGCGGCTCGG TCGAGCTCTC GCCGGAGGTC CGGTTTACGG TCGAAGCCGA CCAGGCCTCC
TACTGGCGGA CGGGCGTCTA CGATCGCTTC GCCGGCGACG AGTGGGTCCG GACCGGCGAC
CGCAGCGACT ACGGGGGTCC GATCGGGAAG CCGCCGGGCG AGAGCGAACG GGTGCACCAG
CTCGTCACGG CCGAAACCAC GCTCGGCGTC ATGCCAGCCG CTCCGCAACC GGTCACGGTC
GACGGGGATC TCACCCAGCA CACCGAGGTC TCGAGCCACG GGCAGATCCA TCCGAGCAAC
CCCCTCGTGG AGGGCGACAC GTACGCCGTC GAGAGCGCGG TCATCGATCC CGACGCCGCC
GCCCTCCAGC GGGCGGGAAC GGACTACCCC GAGCCGATCT CCGAGCGGTA CCTCCAGACG
CCCGAGGACA CCTCCAAGGA GTTCGAGGCC CGAACGGCCG AGATCACCGC CGACGCCGAC
ACGCCCTACG AGAAAGCCGT CGCGATCGAG GACTACCTCC GGTCGACGAA GGGCTACTCC
CTCGAGGTCG ACCGACCGAA CGGCAACGTC GCCGAGGCGT TCTTACTCGA GATGGACCAG
GGCTACTGCG TCTACTTCGC GACGACGATG GCCCAGATGC TGCGAGCCGA GGACGTTCCG
ACTCGCTACG TCACCGGCTA CACGAGCGGC CAGCAGGTCG ACGACGACGA GTACGTCGTC
CGCGGCACCG ACGCCCACGC CTGGGTCGAG GTCTACTTCC CCGACCACGG CTGGGTCGCC
TTCGAACCGA CGCCGTCCGG CCCCCGCAAC GCGGCCCACA ACGAACAGGT CGAGCAGGCC
CGCGAGGACG GCGCCGAGGA CGTCGACACC GACTCGAGCG AGGACGTGCC GCTCTCCGAG
GAGGAAGCGG AGAACGAACC CGGCGAGTCC CCGTCGGAGA TCATCGACGA CAATGAGAGC
GAGGCCCGGA ACGGCTCCGA ACCCGACACC GGGTACGGGC CGGACAACGA GACCGACAAC
GACTCGACCT CGCCCCAACC GGAGCCCGAT CCGTCCGATC CGAACGCGGA CAACGACTCG
CTCACCGACG AGGAGGACGA CGACGGCTTG CTCAGGGCGC TCGTCGAGCG GGTATCGATC
TCCCGCGAGG CAGCCACCGT CGCCCTCGTC GCCCTCACCG GGTTAGTCGC GAGCGTCCAT
CACACCGGCG CGGTCGCGCG GCTCCGCCGA ACGGTCGGCC GCTACTGGCA GCGGCGCAGC
GACGACCCCG ACCGCGACGC CGAACGCGCC TATCGGCGGC TCGAGCGCCT GCTGGCCGCC
TCCCACCGCC CCCGCGAGCG CTCAGAGTCC GCGCGCGGCT ACCTCGAGGC GCTGGCCGAG
GAATCCGACG TGGGAGTCGA TCCGCGGGCG AAAACCGTCC TCGAGCGGTA CGAACGGGCG
GTGTACGGCG GCGGCGTCGA TCGCGAGGGG GCCGACGAGG CGATGGCGAT CGTCGACGAA
CTCGCGCGTG AGCACCTGCC CGGCGTCGGT CGACGGCGAA AGTGA
 
Protein sequence
MSTNSGSQTG ERTIEIAPDG SVGPGTFRLL ALCCVLVLTA SYVSVLRDVT RVVGGTETLL 
ALVVAMGVAA TVLAWAIRPR TATVAALVAG ALGFAYYLEY TGLGVEVLLS AGDVLLSDAA
ALATGLPLLR MVEAGVWTLA FVPAPVFLSW YLAVRGRYAL GVVPGGIALL FLVLTGDAGT
VPTLVGTLAG IGAVGCGELE RRGGSVAQAD LLTALFALIV VLSLTVTVVP GGSGTPGHFA
GTETGTLEGT IDHASDRSSI GGSVELSPEV RFTVEADQAS YWRTGVYDRF AGDEWVRTGD
RSDYGGPIGK PPGESERVHQ LVTAETTLGV MPAAPQPVTV DGDLTQHTEV SSHGQIHPSN
PLVEGDTYAV ESAVIDPDAA ALQRAGTDYP EPISERYLQT PEDTSKEFEA RTAEITADAD
TPYEKAVAIE DYLRSTKGYS LEVDRPNGNV AEAFLLEMDQ GYCVYFATTM AQMLRAEDVP
TRYVTGYTSG QQVDDDEYVV RGTDAHAWVE VYFPDHGWVA FEPTPSGPRN AAHNEQVEQA
REDGAEDVDT DSSEDVPLSE EEAENEPGES PSEIIDDNES EARNGSEPDT GYGPDNETDN
DSTSPQPEPD PSDPNADNDS LTDEEDDDGL LRALVERVSI SREAATVALV ALTGLVASVH
HTGAVARLRR TVGRYWQRRS DDPDRDAERA YRRLERLLAA SHRPRERSES ARGYLEALAE
ESDVGVDPRA KTVLERYERA VYGGGVDREG ADEAMAIVDE LAREHLPGVG RRRK