Gene Htur_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1820 
Symbol 
ID8742414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1895180 
End bp1896919 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content69% 
IMG OID646512398 
Productmembrane-flanked domain protein 
Protein accessionYP_003403378 
Protein GI284165099 
COG category[S] Function unknown 
COG ID[COG3428] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGTC TGCACCCACT CAGCGCGGCC GCCTACGCCC TCCAGTACGG GTTCCTCTGG 
CTGTCGGCCG CGACGATCCT CACGCTCGTC CTCGGCGGGA TCTTCGGTCC CATCGATTCG
GCCTGGGTTC CGATCGCCGC ACCGGTGGGA CTCGTCGCCG GCGCGGCCTA CGGGATCGCC
TACTTCTACC GGTTCGAGTA CGGCATCACT CCGGACACGT TCGACGTCTC GTCGGGCGTG
TTCGCTCGCC GATCGCGCGA AATCCCGTAC GAACGCATCC AGAACGTCGA CGTCCGGCAG
GGAGTGGTCC AGCGACTGCT GGGCCTCGCC GTCGTCTCGA TCGAGACCGC CGGCGGCGGC
AGCACCGAGG CGGCGCTGAA CTTCGTCAGC GAATCGGAAG CCACCCGACT GCAACACCAG
ATCCGAACGC GGACAGCCGA TGTGAGGGAT CGCCGGCACG AGCGCGGGCG GCGCGACGCG
TCGGCGACGA CCGACGAACG GACGATCGAC GAGGCCTCGA GCGACACCGA GACCGAGCGG
ACGAACGCCA CGACCGACCT TGACGAACCG GTACCCGACG CCGGGGAATC AGCGGCGGAA
CCCGACTCGA CGGACGGTAC GACAACGAGC GACTCGGGAC CGGTCTCGGA CAGTCGCGAT
CGCGTCGCCG GACCCGACTC GCGGGGACCG CGTCGACAGC ACCTGTTCGC ACTCGAGGCG
CGGGAACTCC TGCTCTACTC GTTTACGTCG TTCCGTCCCG CCGCCGCCGC GGCCCTCCTG
GGGCTGTTCT TCTTCGCGAC CGACCTCGCT ATCAGCCTGC TCGTGAGCGC CGCGCGGCCG
TTCGGCGGTC CCGCGAATCT GGGCGAGGGA TCGCCGACCA GTTACGGCAT CCTCACGGTC
GTGTCGGTCG TCAACGGCGT CGTGACCGCG TACGTGCTGA GCGTCGTCTA CACGTTCGCC
GCCTACTACG ACTTCCGGCT CGGTCGGGCC GGCGGAGACT TCGTCTACGA GCGCGGGCTG
CTCCAGCGCT ACAGCGGGTC GATTCCCGTC GAGAAGGTCC AGTCGGTGAC GGTGAGTGCC
AACCCCCTCC AACGGCTGCT CGGCTACGCC GGGCTGTGGG TCGAGACGGC CGGCTACGGC
CCCGACAGCG ACAGCGGTGG CAGCCAGTCC GCCGTGCCCC TGGCGGAACG GGGTCGCGTC
CACCGATTCA CCGAGACGCT TACCGGCGTC GAATCGCCGC GCTTTCGGAG CCCACCGACG
ACGGCGCTCC GACGGTATCT CGTCCGGTAC GCCATCGTCG CAACGGTCGT CGTCGCGGCC
GCGTTCGCCG TCACGCGGGT AACGGTCCTC GAACGCTGGT ACGTCGCCGC CGTCGTCTTC
GTCGCAGTCC CGCCCGCGGC CTACCTGAAG TACGTCCACC TGGGCTACTA CGTCGGCGAG
GATCACCTCG TCGTCCGCCG CGGGTTCTGG AAGCAACGGA CGACCGTGAT CCCCTACTAC
CGCATCCAGA CGGTCTCGAC CCGGCGATCG ATCTTCCAGC GTCGCCTCGG ACTCGCGTCG
CTGGTCGTCG ACACCGCGAG CTCGCGCAGT TTCTCCCGGG CGTCGCCGAC GATCTACGAC
GTCAATCTCG AGGACGCGCG AGACGTCCAC GGCACCGGTC GAAAACGCCT GCAGACGGCC
CTTCGCGAGC GCGCTCGGGC CGACGACGGC GGTCCCGGAC TCACCGTTGA CTTCACCTGA
 
Protein sequence
MNRLHPLSAA AYALQYGFLW LSAATILTLV LGGIFGPIDS AWVPIAAPVG LVAGAAYGIA 
YFYRFEYGIT PDTFDVSSGV FARRSREIPY ERIQNVDVRQ GVVQRLLGLA VVSIETAGGG
STEAALNFVS ESEATRLQHQ IRTRTADVRD RRHERGRRDA SATTDERTID EASSDTETER
TNATTDLDEP VPDAGESAAE PDSTDGTTTS DSGPVSDSRD RVAGPDSRGP RRQHLFALEA
RELLLYSFTS FRPAAAAALL GLFFFATDLA ISLLVSAARP FGGPANLGEG SPTSYGILTV
VSVVNGVVTA YVLSVVYTFA AYYDFRLGRA GGDFVYERGL LQRYSGSIPV EKVQSVTVSA
NPLQRLLGYA GLWVETAGYG PDSDSGGSQS AVPLAERGRV HRFTETLTGV ESPRFRSPPT
TALRRYLVRY AIVATVVVAA AFAVTRVTVL ERWYVAAVVF VAVPPAAYLK YVHLGYYVGE
DHLVVRRGFW KQRTTVIPYY RIQTVSTRRS IFQRRLGLAS LVVDTASSRS FSRASPTIYD
VNLEDARDVH GTGRKRLQTA LRERARADDG GPGLTVDFT