Gene Htur_2894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2894 
Symbol 
ID8743511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2965855 
End bp2967120 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content62% 
IMG OID646513479 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003404436 
Protein GI284166157 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGAG ATAACAGGCC GCGTGGCCGG AGGCGTGATT TACTAAAGGC AATTACCGCT 
GGCGGTACGA TCGGGATATC CGGATTGGCC GGCTGCGTCG GCGATCCGGA CGAGTTGGGT
GGCGGAGACG ACGAGAACTT CGACACGGTC CAGTTCGGCG TTCTCGAGCC GAGGACAGGC
GAGTTTAGCG CTCTCGCGAA GGAACACTAT CAGGGAACCG AACTGGCGAT CCAGCAGATC
AACGACAGCG ATGAGTACGA CTTCACGATC GAACACGAGG AGTACGATAC GCAACTCGAC
CCGGCGACGG CGACCCAGCA GGCCCAGCAG GCGGTCCAAT CCGACGGCGC ACAGTTCATC
AGCGGCTGTA TCTCGAGTTC GGCCGCGCTC GCGATCAACA GTTTCGTCGC CGATAACGAA
GTCGTCTACA CGCCGGGAGC GGCGGATATC TCGATCACCG GCGAGAACTG CAACGAGTAC
GTGTTTCGGT TCGAGACGAG CACCGCACAG ATCGCGGAGG TGATGGCCCA GTGGACCGCC
GACGAACTCG GCGATCAGAT CGTCTATCAC ATCGCGGACT ACGCGTACGG CGAGTCGGTA
CTGAACGAGG TCGAGACGCG AATGGAGTCC ACTAGCGACT CCTACGAGCG GGTCAACGTA
ACCAGGTCGG ATCAGGGCTC GACGAACTTC GAGGCGTTCA TCAGTCAGAT TTCGGACGTC
AGTGACGAGG CCGACGCGCT CGTCGTGGGG ATGACCGGTG CCGACCTCGC GATTTTCCTC
TCGCAGGCCA GTTCGCGCGG CCTGCCGGAC GAGATCCCCA TCGTGACGAC GACCGGTTCG
TTCCGAGCCG TACGGGCGGG CGGCGGAGAG GGTGTGTACA ACACCTACAG CGGTGTTCGA
TACGTTCCGG AGATCGAAAC CGGAGACAAC CAGGAGTTCG TCCAGGCCTA CGAAAGCGAG
TACGACGCCC CGCCGGACAA CTTCTCGCGC GTCGGCTACG AATCGATCCG CATGGTCGCC
AACGGCATCC GCGAGGCGGG GTCGCGCGAT CCCACGACGG TCAGGGAGTC GCTCTCGGGC
ATGGAACACG ACACGATCTT CGGTCCCAAC CGGTTCCGGA AGTGCGACCA GCAGGCGATG
AATCCGGTCT GGATGGGCGA GTGCGTCGAA CCGGACTCCG GGGAGCTCGC CGACGTCGAA
CTCCTGACCC AACTCTCCGG CGAAGAGGCC GCACCCGACT GCGAGGAAAC CGGCTGTGAA
CTGTAA
 
Protein sequence
MARDNRPRGR RRDLLKAITA GGTIGISGLA GCVGDPDELG GGDDENFDTV QFGVLEPRTG 
EFSALAKEHY QGTELAIQQI NDSDEYDFTI EHEEYDTQLD PATATQQAQQ AVQSDGAQFI
SGCISSSAAL AINSFVADNE VVYTPGAADI SITGENCNEY VFRFETSTAQ IAEVMAQWTA
DELGDQIVYH IADYAYGESV LNEVETRMES TSDSYERVNV TRSDQGSTNF EAFISQISDV
SDEADALVVG MTGADLAIFL SQASSRGLPD EIPIVTTTGS FRAVRAGGGE GVYNTYSGVR
YVPEIETGDN QEFVQAYESE YDAPPDNFSR VGYESIRMVA NGIREAGSRD PTTVRESLSG
MEHDTIFGPN RFRKCDQQAM NPVWMGECVE PDSGELADVE LLTQLSGEEA APDCEETGCE
L