Gene Htur_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3550 
Symbol 
ID8744170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3652816 
End bp3654408 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content73% 
IMG OID646514131 
Productconserved repeat domain protein 
Protein accessionYP_003405085 
Protein GI284166806 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGAC GGCTGCGCTG GCGCGGCGCG ATCGCGGCGA CGGTCGTGCT CGTCCTCGCC 
GGCCTGCTCG ACGCCAGTCC CGTCCTGTTG CTCAGTGCGA TCGTCCCGCT CGTCTACGTG
GCCTACGGCT CGCTGTCGAC CGTCTCCGTG CCCGAGGGAC TCGCGGCGAC CCGCGAAATC
TCGCCGACGC CGGCGGCGCC CGGCCGACCG GTCACCGTGA CGCTGACGGT GCGAAACGAG
TCCGATCGGA CGGTCACCGA CTGTCGACTC GTCGACGGCG TCCCCGAGGA ACTCGCCGTC
CTCGAGGGAT CGCCGCGGGC CGGCGTGACC CTCGAGCCCG GCGAGGAACG CCGCATCGAG
TACTTGGTCG TCGCCAGACG CGGCGAGTAC AAGTTCGACG CCCCGGAGTG TCGCGTTCGC
GGACTCGGCG CGAGCGCGGT CGCGACGACG CGGCTGTCGA CCACGGGCGC GGAACGGCTG
GTGTGTCGGC TCGACGCCGA CGCGCCGCCG ATCGAGGAGA TCGGACGCGG CCGAATCGGA
CAGTTGACGA CCGACCGACC CGGCGAGGGG CTCTCCTTTC ACTCCGTCCG CGAGCATCGG
CCGGACGATC CGGCCGATCG GATCGACTGG CGCCACTACG CGAAACGCGG GACGCTCGCG
ACGATCGAGT ACGAGCGCCA GGTCGCGGCG ACGGTCGTGC TGGTCGTCGA CGCCCGCCCG
TCCAACGCGG TCGTCGCGGG GCCCGGCCGC CCGACCGCCG TCGAGTTCGC GGCCTACGCG
GCGACCCGGA CGCTTTCGGA CCTGCTCGGA CACGGCCACG ACGTCGGCGT CGCCGTCGTC
GGCCGCGACG GCAACGGGCC CGCCGGCCTC CACTGGATCG AGCCGGCGAA CGGCCGCGAG
CAGCGCACGC GCGCACTCGA GGTCATCCGC TCGGCGACTT CCTCGAAGGC CTCGAGCGGG
CGGTTCTCGG AACCTTCGTC GAGGAATCGG GACCGCGGAC GACTGCCCCG ACAGGTCCGT
AAGGTGCGCG AACTCGCGCC AGCGGGGGCG CAGGTGGCGC TGTTCTCGCC GGTCCTCGAC
GATCAGTCGG TCACGGCGGT CGAGCGCTGG CGCGGGGGCG GTCTCCCCGT CGTCGTGCTC
TCGCCGGACG TCGTCCCGGG CAACACCGTC AGCGGCCAGT ACGCACAGCT TCGGCGGCGG
ACCCGGCTGG CCCGCTGTCA GGCGCTGGGC GCCCGGACCT TCGACTGGCG CCGCGGGACG
CCGCTGCCAG TCCTCATCGA ACACGCCTTC ACCGCCGACG CGCGGCTCTC GAGCGCCCGG
CTCTCGGGCG GATCTGGCGG CGGTCGCGGC CGGAGCGGCG GCGAAACCGA CAGCGAGACC
GGATTCGGTG GCGAAGCCGG ATCCGGGACC GGTGGCGTCG ATTCGACGGC GTCGGCGACG
CTCGAGTCGG AATCGGAGGC GACCGAGTCG GAGTCGACGA CGGCCGAGTA CACGTGGCGG
TCGCCGGCCG ATGGCTCCGA GGGGACCGAA CGCGAGAGAT CGGTCTCGAC CGACGCCGAC
GGCGCCTCGG CGAAACGAGG AGGTGATCGG TAG
 
Protein sequence
MSRRLRWRGA IAATVVLVLA GLLDASPVLL LSAIVPLVYV AYGSLSTVSV PEGLAATREI 
SPTPAAPGRP VTVTLTVRNE SDRTVTDCRL VDGVPEELAV LEGSPRAGVT LEPGEERRIE
YLVVARRGEY KFDAPECRVR GLGASAVATT RLSTTGAERL VCRLDADAPP IEEIGRGRIG
QLTTDRPGEG LSFHSVREHR PDDPADRIDW RHYAKRGTLA TIEYERQVAA TVVLVVDARP
SNAVVAGPGR PTAVEFAAYA ATRTLSDLLG HGHDVGVAVV GRDGNGPAGL HWIEPANGRE
QRTRALEVIR SATSSKASSG RFSEPSSRNR DRGRLPRQVR KVRELAPAGA QVALFSPVLD
DQSVTAVERW RGGGLPVVVL SPDVVPGNTV SGQYAQLRRR TRLARCQALG ARTFDWRRGT
PLPVLIEHAF TADARLSSAR LSGGSGGGRG RSGGETDSET GFGGEAGSGT GGVDSTASAT
LESESEATES ESTTAEYTWR SPADGSEGTE RERSVSTDAD GASAKRGGDR