Gene Huta_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1548 
Symbol 
ID8383827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1522100 
End bp1523506 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content60% 
IMG OID644972610 
Productprotein of unknown function DUF21 
Protein accessionYP_003130456 
Protein GI257052623 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATAT CTGTGCTTAC AGCAGTGGCA GAGGCCAGTG CGTACCTGTA TACCGTACCG 
ATCGTCGGCA TCGAGCTTTC AAAAACAGGT GTCACGGCGA TCGGCATCCT CATGATCCTG
TTTCTCATCG TCGGATCGGG CTTTTTCTCC TCCTCCGAGA TCGCGATGTT CTCGCTGGGG
ACCCACCGGA TCGACCCGAT GGTCGAACAG GGGCTCCGTG GGGCGAAAGC GATCAAGTCA
CTCAAGGAGG ACCCCCACCG GTTGCTCGTG ACGATCCTGG TCGGGAACAA CATGGTCAAC
ATCACGATGT CCTCGATCTC GACGACCATC GTGGGTTTCT ACTTCGATCC GGGGACGGCA
GTCCTCGTCT CGTCGTTCGG GATCACGTCA CTGGTGTTGA TATTCGGTGA GACGGCACCC
AAATCCTACG CCGTCGACAA CACCGAGTTA CATGCACGCC GCGTGGCTCC AGTACTGCAG
TTCGTCGAGA AACTGCTGTG GCCGCTGATC ACCCTCTTTC ACTACGTGAC CCAGTTCGTC
AACAAACTCA CGGGCGGCGG GCCGGCCATC GAGTCGTCGT ACCTCAGCCG GTCGGAGATC
CGGGAGATGA TCCAGACCGG CGAGCGCGAG GGAGTCCTCG ACGAGGAAGA GCGACAGATG
CTCCAGCGGA CCCTCCGGTT CAACCGGACG ATCGCCAAGG AGGTCATGAC GCCGCGCCTG
GACATGGACG CCATCTCGGC CGACTCGTCG GTCGAAGAGG CGATCGCGGA GTGTGTCCAC
AGCGGCCACA CCCGGCTGCC GGTCTACGAG GGTGGTCTCG ACAACGTCAT CGGGGTCGTC
AACATCCGTG ATCTCGTCCG TGACGCCCAG TACGGCGGGA CAGACGATGT CGAGCTTCAA
GACCTCATCG AGCCGACGCT GCACGTCCCC GAAAGCAAGA ACGTCGACGA TCTCCTGACG
GAGATGCGGA GCGAACGCCT CCACATGGTG ATCGTCATCG ACGAGTTCGG CACCACAGAG
GGACTCGTCA CCATGGAGGA CCTCACCGAG GAGATCGTCG GCGAGATCCT CGAAGGCGAA
GAGGAACACC CGATCGAATT CGTCAACGAC GACACCGTGA CGGTCAAAGG GGAAGTCAAC
ATCGAGGAAG TCAACGAAGC GCTGTCGATC GACCTCCCGG AGGGCGAGGA GTTCGAGACC
ATCGCCGGGT TCATCTTCAA CCGAGCGGGT CGGCTCGTCG AGGAAGGCGA ATCCATCGAG
TACGAGGGGA TTCAGATCCG TGTCGAGCAA GTCGAGAACA CCCGGATCAT GAAAGCCCGG
ATCACACGAC CGGAAGAGGG AGCGACACTC GAATCGGAAG CCGAAGGCGG GGACGACGAC
CACGAGAGTG ACACGAACGA CGCCTGA
 
Protein sequence
MGISVLTAVA EASAYLYTVP IVGIELSKTG VTAIGILMIL FLIVGSGFFS SSEIAMFSLG 
THRIDPMVEQ GLRGAKAIKS LKEDPHRLLV TILVGNNMVN ITMSSISTTI VGFYFDPGTA
VLVSSFGITS LVLIFGETAP KSYAVDNTEL HARRVAPVLQ FVEKLLWPLI TLFHYVTQFV
NKLTGGGPAI ESSYLSRSEI REMIQTGERE GVLDEEERQM LQRTLRFNRT IAKEVMTPRL
DMDAISADSS VEEAIAECVH SGHTRLPVYE GGLDNVIGVV NIRDLVRDAQ YGGTDDVELQ
DLIEPTLHVP ESKNVDDLLT EMRSERLHMV IVIDEFGTTE GLVTMEDLTE EIVGEILEGE
EEHPIEFVND DTVTVKGEVN IEEVNEALSI DLPEGEEFET IAGFIFNRAG RLVEEGESIE
YEGIQIRVEQ VENTRIMKAR ITRPEEGATL ESEAEGGDDD HESDTNDA