Gene Htur_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1036 
Symbol 
ID8741623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1070364 
End bp1071665 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID646511614 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003402601 
Protein GI284164322 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCGAA CCGAAGCCAA CAGAAACCGC TGGTTGATCG CGCTCTCGGC GATCGCGATC 
CACCTCTCGA TCGGATCGAT TTACGCGTAC AGTGTCTATC AGAACCCGCT TCGCGACGAG
TTGGGATGGG CGATATCCGA CGTCTCGCTG GCCTTTACCG TCGCGATCGT CTTTCTCGCG
CTCTCGGCGG CGTTCCTCGG GGGCTTCGTC GAGAACCGCG GACCGCGGAC CTCGGGACTG
ATCGCCGCCG GGACCTTCGG GCTCGGGATC ATCGGCGCCG GCCTCAGCGT CCAGCTCGAG
ACGTACGCCG GCTTCATCCT CACGTTCGGC GTGATCAGCG GGATCGGCAT CGGGCTCGGT
TACATTACCC CGATCTCGAC GCTCGTCCAG TGGTTCCCCG ACCGCCGCGG GATGGCCACC
GGGATGGCCG TCATGGGCTT CGGCGCCGGG GCGCTCGTGA CCGGCCCGGT CGCGAACTAC
ATCATCGAGG CCGCGAGCAT TCCGGTCGCC TTCTACGCGC TCGGTGTCGC CTACTTCCTG
CTGATGGCCG CCGGCGCGAG CTACCTGAAG AAGCCGCCGA CTGACTGGGT CCCGGCGGGA
ATCGACGAGA GCGAGATCGA CACCGCGGAC AACGAGAAGG GGGTCTCCGT CAACACGGAC
CTCGCGGAGC TCACCGGTAG CGAGGCGCTG CGGACGCCGC GGTTCTACCT CGTCTGGCTG
ATCATGTTCA TCAACATCTC GGCGGGGATC ATGCTGCTGT CGGTCGCCTC GCCGATGACC
CAGGCCATCA CGGGAGTCGA AGCGGCGACG GCGGCGTCGG TCGTCGGCCT CATCGGCATC
TTCAACGGCG GCGGTCGGAT CTTCTGGGCG ACCACCTCCG ACTACATCGG GCGGACGACC
ACGTACGGCG TGTTCTTCGG CCTGCAGATC GTCGCCTTCC TGCTGATGCC CCAGCTCAGC
AACCTCTGGC TGTTCTCGAG CCTGATGTTC CTGATCATCA CCGCCTACGG CGGCGGGTTC
GCCTGCCTGC CGGCGTACCT GGGCGACCTG TTCGGAACGA AAGAACTCAG CGCCATCCAC
GGCTACACCC TGACGGCGTG GGGCGCCGCC GGCGTCGCGG GGCCGATGCT CATCTCGGAG
ATCGTCGAGC GGACCGGTAG CTACGTGATG GCGTTCTACA TCGTCACCGG AGCGTTGGTC
GTCGGACTGG CTTCCGTGGC CGTCCTCTAC GTCCGAATCA AATCCGTTCG CGACGCGCGC
GGCGGTCCGA GTCGCCGGCC GTCCGAGCAG GCGACCGACT GA
 
Protein sequence
MVRTEANRNR WLIALSAIAI HLSIGSIYAY SVYQNPLRDE LGWAISDVSL AFTVAIVFLA 
LSAAFLGGFV ENRGPRTSGL IAAGTFGLGI IGAGLSVQLE TYAGFILTFG VISGIGIGLG
YITPISTLVQ WFPDRRGMAT GMAVMGFGAG ALVTGPVANY IIEAASIPVA FYALGVAYFL
LMAAGASYLK KPPTDWVPAG IDESEIDTAD NEKGVSVNTD LAELTGSEAL RTPRFYLVWL
IMFINISAGI MLLSVASPMT QAITGVEAAT AASVVGLIGI FNGGGRIFWA TTSDYIGRTT
TYGVFFGLQI VAFLLMPQLS NLWLFSSLMF LIITAYGGGF ACLPAYLGDL FGTKELSAIH
GYTLTAWGAA GVAGPMLISE IVERTGSYVM AFYIVTGALV VGLASVAVLY VRIKSVRDAR
GGPSRRPSEQ ATD