Gene Htur_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1414 
Symbol 
ID8742005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1468262 
End bp1470172 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content67% 
IMG OID646511992 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003402975 
Protein GI284164696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCA AAGAAAATCA TGATGGATCG GCCACCGAGG GAGGTGACCG ACTCACGCGA 
CGCGGCTACG TCGCCACGGC CGCCGCGGCG CTCGGGACGA GCGTGCTCGC GGGCTGTAGC
GGCAGTCGCG GCACCAGCCT CGAGCCGGAC GCGCCCGATG GGGTACCGGA AACGGTCGAG
ACGCAGTACT GGCGCGAGTG GGAGACGATC GACGCCGACT CGCCGCCACT GGAGTACAGC
GCGACGGCCG GTGCGGTGCT CGATCGGTTC CCCGTCGAGT TCTCGAGCGA GGACGACCCG
TGGATGCGCG AACACGCGTT GATGGTCAAG CGGGGGCTCG GCGATTTGGG TATCGCCGTC
GAACTCAACG ACCGTCCGCT GAATCAGCTG TACGCCCAGA GCTGGGAAAC GCGCGGACTC
GAGGCGATCG TCTCGATGAG TACGCACGGA CCGGACCCGC AGCGGGGGCT CGATCCGAAC
CCGCTGTTGA TGCGGCGGAC CGAGGGCTCG CTCTCGAACT ACGATAACTA CTACCATCCG
GAACTCCAGG AGGTGCTCAC CGAGCAGGCC CAGACGACCG ATCGGGCCGA ACGCGAGGAG
CTCGTCGACC GGGCACAGGA ACTCTTCGCC GAGGACGTCG GGGCGCTCAT CACGCTCTTC
CCGGAGATCA TCACGGCGGT GAACACGGAC CGATGGACCG GCTACGTGGA GACGCCGGGG
AACGGCCCGA CGATGGACTC GTTCGTCTGG ACGGAGGTCA ACCTCCAGCC CGAGACGGAC
AACCGGACCT ACGTCAAGGG CGTCACCACG TCGATGAACT CGCTGAACCT GCCGTGGGCC
GCCGGCGGCG CGGAGGCCAA TCGACTTACG TTCATCTACG ACGGGTTATT CGACGCGACG
CCCGATTTGG ACGTGGCCCC CGCACTGGCG ACCGGCGGCG ATTTCGTCGA CGACACGACC
GTCGAACTCA CGCTGCGCGA GGGCGTCGAG TGGCACGACG GCGAGGCGTT CACCGCCGAG
GACGTGAAGT TCACCGTCGA ACTGTACAAG GAGTACTCCT CCACGAGTCA GGTCCCGTTC
TACGAGCCGA TCGAGTCCGT CGAGGTACTC GGGGACCACG AGGTCCGGTT CGAGCTGTCG
AACCCCGACG CCTCGTTCAT GACCCAACGG GTCGTCCGGA GCGTCATCCT CCCGAAACAC
CGGTGGGAGG ACGTCGACAA CCCGTCGCAG CACAACCCGG ACGCCCCCGT TGGCACCGGC
CCCTTCCAGT TCGAGAACTG GGAGCAGGGG ACCCGATTCG AGGCCACGCG CAACGACGAC
CACTGGATGT TCGACGACGA CTGGCGGGCC GACGCCCTCG GCGAGCAGGC CGAGCGCGGC
CCCGGCATCG AGAGCGTCAT CTGGATCAAC GTGAGCAACG TCGACGCGCT GATCGGCTCG
CTCCAGAGCG GATCGATCGA CGCCATCGGG ACGACCCTCT CGACCCTGCA GGCCGACCGG
GCGGCCAACA CGGACGGGAT CGAGAAGCTG TCGACCGGGA GCTACGCGCC GCTCGACACG
AAGCTCATGT TCTCCTGTCC GCCGATCAGG GACAAGGAGT TCCGCGTCGC ACTGGCGAAA
GCGGTCGACT CGCAGGGGTT CGTCGACGAC TTCCTCGACG GGCAGGCGAC GGTGCCGGCC
GGCGAGAACC CGATCTCGTC GCTCACCCAG TGGCACAACG CCGACACGAC CGACTACAGT
TACGACGTCG AGGAAGCCCG GAACGTCCTC GAGCGCGCGG GCTACACCTG GGACGACGAC
GGCAACCTGC GGTTCCCCAA CGGCGAGGCG TGGGGCGCGT TCGTCGACCG CATTCAGCCC
GAGAACACCC ACAAACGCCG CTCGGAGCTC GGCCAGCCCG ACTTCTCATG A
 
Protein sequence
MTTKENHDGS ATEGGDRLTR RGYVATAAAA LGTSVLAGCS GSRGTSLEPD APDGVPETVE 
TQYWREWETI DADSPPLEYS ATAGAVLDRF PVEFSSEDDP WMREHALMVK RGLGDLGIAV
ELNDRPLNQL YAQSWETRGL EAIVSMSTHG PDPQRGLDPN PLLMRRTEGS LSNYDNYYHP
ELQEVLTEQA QTTDRAEREE LVDRAQELFA EDVGALITLF PEIITAVNTD RWTGYVETPG
NGPTMDSFVW TEVNLQPETD NRTYVKGVTT SMNSLNLPWA AGGAEANRLT FIYDGLFDAT
PDLDVAPALA TGGDFVDDTT VELTLREGVE WHDGEAFTAE DVKFTVELYK EYSSTSQVPF
YEPIESVEVL GDHEVRFELS NPDASFMTQR VVRSVILPKH RWEDVDNPSQ HNPDAPVGTG
PFQFENWEQG TRFEATRNDD HWMFDDDWRA DALGEQAERG PGIESVIWIN VSNVDALIGS
LQSGSIDAIG TTLSTLQADR AANTDGIEKL STGSYAPLDT KLMFSCPPIR DKEFRVALAK
AVDSQGFVDD FLDGQATVPA GENPISSLTQ WHNADTTDYS YDVEEARNVL ERAGYTWDDD
GNLRFPNGEA WGAFVDRIQP ENTHKRRSEL GQPDFS