Gene Htur_2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2478 
Symbol 
ID8743087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2543279 
End bp2544253 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content62% 
IMG OID646513064 
Productphosphate binding protein 
Protein accessionYP_003404029 
Protein GI284165750 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGAG AAACCACGGC GTCCCTGCCG ACCGGCTTCG GTCGGCGTGA TTTTCTCGCC 
GCGGCTGGCG TCGCTCTTTC CGGTGGGTTA GCCGGCTGTA GCGGCGTCTT CGCCGCGGAA
GGGAAACAAG TGAACGTCGC GGGGAGCAGC ACGGTGTTTC CCGTGACTGA GGCGATCGCC
TCTGCGTTCT CGGAGAAAAA CCCGACCGTC AATATCTCGC TCAGCAAGAC CGGGACCGGC
GGCGGGTTCG GGAACTTCTT CTGTGCGGGG CGAACCGACA TTAACAACGC GAGTCGGGAG
ATCGCCGATG CCGAAATCGA ACAGTGCGGA AACAACGATG TCACACCGAT CGAGTTCCAG
ATCGCGACCG ACGCGCTAAC GGTGGTCGTC AATCCGGAAG CCGACTGGGT CGACTGTCTC
ACAGTCGAAC AACTCCAGGA GATCTGGCGG ACCGACGGCG CCCAGCGGTG GAGCGATCTC
AACGACGAGT GGCCCGACGA GGAGATCGAA CTGTACGGCG CCGCGACCAC CTCGGGCACG
TTCGACTACT TCAACGAGGC GGTCCTCGGC GAGGAACTGA ACCACCGTAG CGACTACTAC
GCGACCGAAC GGGATCGAAC GATCGTTCAG GGAGTCCGCG GTTCGGAAAC TGCTATGGGG
TACTTCGGCT TCTCGTACTA CAGCGAGAAC CCGGATTCGA TCAAAGCCGT CTCGATCGAC
AGCGGTGACG GTTGTGTCGA GCCGTCGATC GAGACGGCCA TGTCCGGCAA GTACACGCCG
CTTTCGCGAC CGCTGTTCAT CTACGTCGCG AAGGAATCGC TCACGAAACC GGCGGTCCGT
AACTTCGTTC GGTTCTACAT GGAACAGGCC GCGACGGATC TCGTCTCCGA CGTCGGCTAC
GTGCCGATCA CCGAGGACAA GCGCGACGAG AACCTCGAGA AACTCGACGC CGCGGTCGCG
GAGGTGACCG AATGA
 
Protein sequence
MSRETTASLP TGFGRRDFLA AAGVALSGGL AGCSGVFAAE GKQVNVAGSS TVFPVTEAIA 
SAFSEKNPTV NISLSKTGTG GGFGNFFCAG RTDINNASRE IADAEIEQCG NNDVTPIEFQ
IATDALTVVV NPEADWVDCL TVEQLQEIWR TDGAQRWSDL NDEWPDEEIE LYGAATTSGT
FDYFNEAVLG EELNHRSDYY ATERDRTIVQ GVRGSETAMG YFGFSYYSEN PDSIKAVSID
SGDGCVEPSI ETAMSGKYTP LSRPLFIYVA KESLTKPAVR NFVRFYMEQA ATDLVSDVGY
VPITEDKRDE NLEKLDAAVA EVTE