Gene Htur_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3068 
Symbol 
ID8743688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3146847 
End bp3148265 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID646513652 
ProductLVIVD repeat protein 
Protein accessionYP_003404606 
Protein GI284166327 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGGA GAGCATTCCT CCGAGCGGGT GGGGCCGCCG GCGCCGTCCT GGCCGTCTCG 
GGAGCGGCGG TCGGGACGTC CACAGCGACG CCGCGCGCCA GCCAGGAGGT CCCCGACTCG
TTCGAACCGC TCGGGCAGCT CGAGCTACCC GACAGCGACC CCGCCGAAGT GGTCATCGAC
GACCACGGCG AGACGGCGTA CCTCGCCACG ACGTACGGGT TCGCGACCGT CGACCTCGGC
GACCCGACCG CGCCGGAGCT GCTCGCCGAA CGAACTCGGC TCGAGGCCGA CGGCCAGGAG
TTCACGCGGA TTTTCGACGT CAAAGTCGAC GGCGACCGAC TCGCGGTCGT CGGGCCCGCA
GACGAGGGGT TCGGCGAGTT CAACGGGTTC GAACTCTACG ACGTCAGCGA CCCCGCGGAC
CCCGCGGTCG TCGACCGCTA CGAGACCGGC TTTCACATCC ACAACTGCTT TTTCGCGGAC
GAGCTGCTGT ACGTCGTCAA CAACGGCCCC GACGATACCG CGCTCGTCAT CTACGACACG
AGCGACGACG ACACCGAGGC GGTCGGCCGC TGGTCGCTAC TCGACCACGA CCCCGAGTGG
GAGGACGTCT ACTGGTACGT CCATTATCTC CACGACGTCA CCGTCCACGG CGACCTCGCC
GTCTTCCCGT TCTGGGACGC CGGCACCTAT CTGGTCGACG TCAGCGACCC GAGCGATCCA
AAGTATGTCT CACACGTGCG CGACCCCGAC GTCAGCAGGG ACCGAAGCTA CGGGGAGAGG
GAGGCCGTCT ACGGCCTGCC GGGCAACGAC CACTACGCGA CGGTCGACGA CGCCGGCGAG
CTCCTCGCGG TCGGTCGCGA GGCCTGGACG ACCGGCGGCT CGGCGCCGGA CGGGCCGGGC
GGGATCGACC TCTACGACGT CACCGATCCG TCGGCGCCGG AGCCGCTGGC GTCGATCGAG
CCGCCCGAAA GCGACGACGC GTCCCGCAGG GGCGGCGAGT GGACGACCGC CCACAACTTC
GAGTTACGCG ACGGGCGCCT CTACTCGGCG TGGTACCAGG GCGGCATCAA AATACACGAT
GTGAGCGACC CCGCCGCTCC CGAGGGACTC GCCCACTGGC GGGCGACCGA CGACGCCGCG
CTCTGGACGG CACGCGTCGC CAACGACGGC GCGACGGTCG TCGCGAGCAG CACGTCGCGG
CTCCCTGCCA CGGACATCGA CGGCGCGCTG TACACCTTTC CGACCGGACT CGAGAGTGAC
GGATTCGAGA CGGGCGGCGA CGACAACGGC TCCGATGGCG ACGGGAACGA TTCCCTCAGC
GACCGGGTTC CCGGCTTCGG AGGACTCAGC ACTGGGATCG GACTCGCCGG CAGTGCGGCC
GCCCTCGAGT GGGTCCGTCG ACGCGGCGAC GATCGGTGA
 
Protein sequence
MQRRAFLRAG GAAGAVLAVS GAAVGTSTAT PRASQEVPDS FEPLGQLELP DSDPAEVVID 
DHGETAYLAT TYGFATVDLG DPTAPELLAE RTRLEADGQE FTRIFDVKVD GDRLAVVGPA
DEGFGEFNGF ELYDVSDPAD PAVVDRYETG FHIHNCFFAD ELLYVVNNGP DDTALVIYDT
SDDDTEAVGR WSLLDHDPEW EDVYWYVHYL HDVTVHGDLA VFPFWDAGTY LVDVSDPSDP
KYVSHVRDPD VSRDRSYGER EAVYGLPGND HYATVDDAGE LLAVGREAWT TGGSAPDGPG
GIDLYDVTDP SAPEPLASIE PPESDDASRR GGEWTTAHNF ELRDGRLYSA WYQGGIKIHD
VSDPAAPEGL AHWRATDDAA LWTARVANDG ATVVASSTSR LPATDIDGAL YTFPTGLESD
GFETGGDDNG SDGDGNDSLS DRVPGFGGLS TGIGLAGSAA ALEWVRRRGD DR