Gene Htur_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4780 
Symbol 
ID8745370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp394304 
End bp395635 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content65% 
IMG OID646515278 
Productprotein of unknown function DUF21 
Protein accessionYP_003406225 
Protein GI284172843 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000717269 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTCG GCGCAGTCGC CGGCGGCCTG CTCCCATCGG ATCTCGTGGC TCCCGCCGGC 
GTCGCGGCGC TGCTGGTGTT GCTGGTCCTG TCAGGGTTCT TCTCCTCGGC GGAGATCGCG
ATGTTCTCGC TGGCCCAGCA CCGCATCGAG GCGCTCGTCG AGGACGGCAC TCCCGGCGCC
GAGACCGTCC AGGCGCTCAA GGACGATCCC CATCGGCTGC TGGTGACGAT CCTCGTCGGG
AACAACCTCG TCAACATCGC GATGTCGTCG ATCGCGACCG GACTGTTCGC GATGTACATG
AGCCAGGGCC GAGCGGTGCT GGCGGCGACG TTCGGCGTGA CGGCCGTCGT CCTGCTGTTC
GGCGAGAGTG CTCCCAAGTC CTACGCCATC GAGAACACCG AATCGTGGGC GCTGTCGGTC
GCTCGTCCCC TCAAAATCTC GGAGTACGCG CTGTTTCCGC TCGTGATCAC GTTCGACTGG
CTGACCCGCG TAATCAATCG GCTGACCGGC GGCGGTACGG CCGTCGAAGA GTCGTACGTG
ACCCGCGAGG AACTCCGGAA CCTGATCCGG ACCGGCGAGA GCGAGGGGAT CATCGAGACC
GACGAGCGCG AGATGCTCCA GCGCGTGTTC CGGTTCACCG ACACCATCGC CAAAGAGGTG
ATGACGCCGC GACTCGACGT CACCGCGGTC GCTCGAGAGG CCAGCGTCGA CGAGGCCGTC
GCGAAATGCG TCGAGAGCGG CCACACCCGC CTGCCGGTCT ACGACGGCGA CCTCGACACC
GTCGTCGGCG TCGTCGAACT CGGCGACCTC GTCCGCGACC GTCAGTACGG CGAGACCGAA
GACGAAACCC TCGAACTGTA CCTCGAGGAG ACGCTGCACG TACCGGAGAG CAAGCAGGTC
GACGAACTGT TCCGCGAGAT GCGCCAGCAG CGCGTCGAGC AGGTCGTCGT CATCGACGAG
TTCGGAACGA CGGAAGGGAT CGTCACGACC GAGGACATCG TCGAGGCGAT CGTCGGCGAG
ATCCTGGAGA CGCAGGAGGA CGAACCAATC GAGGTCGTCG ACGACCGAAC CGTTCGCGTC
AACGGCGAGG TCAACATCGA GGACGTCAAC GACGCCCTCG AGATCGACCT GCCGGAGGGC
GAGGAGTTCG AGACGATCGC CGGCTTCGTC TTCAATCTCG CCGGCCGACT GGTCGAACCC
GGCGAGACGT TCACGTACGA CGGTGTCGAT CTCACGGTCG AAACCGTCGA TACGACGCGC
ATCAAACGCG TTCGGATCGT CGAACCGGAA CCGTCGGCGA CTGATGACTC CGGTATCTCC
GCGACGAGTT GA
 
Protein sequence
MLVGAVAGGL LPSDLVAPAG VAALLVLLVL SGFFSSAEIA MFSLAQHRIE ALVEDGTPGA 
ETVQALKDDP HRLLVTILVG NNLVNIAMSS IATGLFAMYM SQGRAVLAAT FGVTAVVLLF
GESAPKSYAI ENTESWALSV ARPLKISEYA LFPLVITFDW LTRVINRLTG GGTAVEESYV
TREELRNLIR TGESEGIIET DEREMLQRVF RFTDTIAKEV MTPRLDVTAV AREASVDEAV
AKCVESGHTR LPVYDGDLDT VVGVVELGDL VRDRQYGETE DETLELYLEE TLHVPESKQV
DELFREMRQQ RVEQVVVIDE FGTTEGIVTT EDIVEAIVGE ILETQEDEPI EVVDDRTVRV
NGEVNIEDVN DALEIDLPEG EEFETIAGFV FNLAGRLVEP GETFTYDGVD LTVETVDTTR
IKRVRIVEPE PSATDDSGIS ATS