Gene Huta_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0940 
Symbol 
ID8383213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp905886 
End bp907112 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content65% 
IMG OID644972004 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_003129856 
Protein GI257052023 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTGT TCGAACTCCA GCGCGAGGGC GACGTCCAGG AACTCATCAG GCTCCTCCGG 
GAGAGCGACA ACGAGGAGGT TCGGATGCGC ACCGCGTCGA TGCTCGGGGA GTTCGACGAG
CATGCGGATC GGCGGGATGT CGTGAGTGCA CTCGTCGAGG CTGCTGAAAC TGACGATTCG
GGCATGGTGA CTGCCGCGGC AGTCGACTCC CTCGACGAAC TCGGCCAAGA TGCCATCGAG
GCACTCATCG AGTCGATGGC GGGGGTCGAC TTCGAGGGTG GCGAAGCGGA CTGGGTGCGC
GCGAAGGCCT TCGTCAACGC ACTCGAAGCC GACGTGCCCG AACTCCGGAT GGCCGCCGCC
AACGGCCTCG GGCAGTTCGG TGACACCGAT GCGATCGAAC CACTTGTCGG GCGGTTTACC
GATCCTGATC CCCGAGTGCG TGCGCGGGCT GCCAGGGCGT GCGGGTCGAT CGGCGACCCG
CGGGCGACGG AGCCGCTTGA ATCCCTGTTG ACGGACGACG CCGGCGTCGT CCGGCGGGAA
GCCGCCGAGG CACTCGGACA GATCGGGAAC CGACAGGCAC TCCAGGCCCT GCTCGACCTC
TACGACGATC CAGCGGAGCG TGTCCGCCGG ATCGCGGTCA ACGCTTTCGG TAACTTCGAT
AACGCCGCGC CCGTCGATGC GCTGGTCAAT GCACTCGGTG ACGATGCGGC GACCGTTCGG
CGGACGGCCG TCTATTCGAT CATTCAATTG CTCGCGAACG TCCCCACACA GAAGAGTCAC
GAGATCCGCG AGACGATCGT CGACCGGTTG AGCGAGACTG ACGACGACAG CGTCGTAGCT
CCACTGGTCG AGATTTTGGA GGAGGGGACC CAGGTCGCAC AGCGCCGGAA CACGGCGTGG
CTACTCGGCC GGGTCGTCAC CGAACCTGAC GATCGGGTCC TCGACGCGAT GATCGCGGCG
CTCGACGACG ACGATCAGAT GACCTCACAG TTCGCGGCGA CCAGTCTGAC TGAACTTGAA
GACGTCAGCG TCGAGCGCCG TCTTCTCGAC GTCGTCACGG ACGATGAGCG CCCGACACAG
GCCCGGACCC AGGCGATCTT TGCATTGGGG AAGGTCGGTG GCGACCGGTC GCGAGAGACG
CTGGACACGC TGATCGACGA GACCGACAAC GAAGAGATAC GCAAACGCGC GTTCTCGGCC
ATCTCAAAGC TCGGAGGGCG ACTGTGA
 
Protein sequence
MSLFELQREG DVQELIRLLR ESDNEEVRMR TASMLGEFDE HADRRDVVSA LVEAAETDDS 
GMVTAAAVDS LDELGQDAIE ALIESMAGVD FEGGEADWVR AKAFVNALEA DVPELRMAAA
NGLGQFGDTD AIEPLVGRFT DPDPRVRARA ARACGSIGDP RATEPLESLL TDDAGVVRRE
AAEALGQIGN RQALQALLDL YDDPAERVRR IAVNAFGNFD NAAPVDALVN ALGDDAATVR
RTAVYSIIQL LANVPTQKSH EIRETIVDRL SETDDDSVVA PLVEILEEGT QVAQRRNTAW
LLGRVVTEPD DRVLDAMIAA LDDDDQMTSQ FAATSLTELE DVSVERRLLD VVTDDERPTQ
ARTQAIFALG KVGGDRSRET LDTLIDETDN EEIRKRAFSA ISKLGGRL