Gene Htur_5224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5224 
Symbol 
ID8745772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp120273 
End bp121388 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID646515581 
Producthypothetical protein 
Protein accessionYP_003406528 
Protein GI284176251 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0997933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGTC GGGCCTTTCT CCGTGGTACG GCCGTCGCCG GCACCGCCGC TATCGCCGGC 
TGTCTCGAGC GTCTGGGTTT CGAAGAGGAG TCGGCCTGGG ACAACCCGCC GCTCGTGGAG
GATCGCCCCG ACGCGGTCTA TCTGCCCGCG GGCAAAGAGG AGATGGGCCA CTACGGTCGC
GCGAGCGACG GCGAGTACGC CGTCGAACTC TCCTATACGA TCCCCCACCG GTTCTGGACT
GTCTCCGGCG ACACCCAGCG GGTCGACGTG GACACCGACG ACAGCATGCA CCTCATGCTG
ACCGTCTGGG ACGAGGAAAC GGACACCATC CTTCCGGTGA ATACCGACCT CGAACTCCAG
CGTGAGGACG GCGAGGTCGT CGAGCAGCTG ACGCCGTGGT CGATGCTCTC CCAGCGGATG
GGGACCCACT ACGGCGACAA CGTCACGCTC CCCGAAGAAG GCGCCTACAC CGCCCGCGTC
CGGGTCGGTC CGGTCACGAC CGACCGAACC GGCGCGTTCG AGGGTCGGTT CGAGGAGACG
AGCACGCTCG AGGTCGAGTT CGAGTTCGAG CGCTCGGACA TCCACGACCT CGAGTTCAAC
ATGGTCGACG AGGAGCGGCG GGGCGCCCGC GAGGCCCACA CACTGATGGA CCCCAGTGGA
CACGACGGGC ACGGGGATGG CGGGCACGGC GACGGCGAAC CCGGACACGC CCCGACATCC
GACGGGCCGC CGGTCGCGGA GCTTCCCGGC GACCGGCTCG GAACCGAACG CAGCGCGGAC
GCGAAGATCA CCGCGATCCG GGCGAGCGCC GAACGGGTGG CCGGCGACGG CGACTATCTC
GTCGTCTGTC CCCGAACGCC GTACAACGAC GTGAGCCTCC CGTCCGCGAC GCTGCGCGCT
ACGGTCGAGC GCGACGGAAC GACCGTCCTC GAGGGTGAGT CGCTCGCAGA GACGATCGAT
CCCGAGTTTG GCCACCACTA CGGACTCGAT CTCGAGGCCC TCGAGAGCGG TGACGAACTC
ACCGTCGCCG TCGACCGACC GCCGCAGGTG GCGCGCCACG ACGGCTACGA AACCGCGTTT
TTCGACTTCG ACGACGTGCG GTATACCGTG TCCTGA
 
Protein sequence
MNRRAFLRGT AVAGTAAIAG CLERLGFEEE SAWDNPPLVE DRPDAVYLPA GKEEMGHYGR 
ASDGEYAVEL SYTIPHRFWT VSGDTQRVDV DTDDSMHLML TVWDEETDTI LPVNTDLELQ
REDGEVVEQL TPWSMLSQRM GTHYGDNVTL PEEGAYTARV RVGPVTTDRT GAFEGRFEET
STLEVEFEFE RSDIHDLEFN MVDEERRGAR EAHTLMDPSG HDGHGDGGHG DGEPGHAPTS
DGPPVAELPG DRLGTERSAD AKITAIRASA ERVAGDGDYL VVCPRTPYND VSLPSATLRA
TVERDGTTVL EGESLAETID PEFGHHYGLD LEALESGDEL TVAVDRPPQV ARHDGYETAF
FDFDDVRYTV S