Gene Huta_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1790 
Symbol 
ID8384077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1796164 
End bp1797435 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content69% 
IMG OID644972857 
Productdomain of unknown function DUF1743 
Protein accessionYP_003130695 
Protein GI257052862 
COG category[R] General function prediction only 
COG ID[COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGTCA TCGGCATCGA CGACACGGAC TCCCGGACGG CGGGCATGTG TACGACCTAC 
CTCGCCGCCC GGATCGCCGA CCGGCTGGCG GCGACGGATG ACTGGGTCGT CGACCGGCGG
CTGCTCGTCC GACTGAACCC TGCCGTCGAG TACAAAACCC GGGGCAACGC CGCGCTGGCG
ATCCACACCG ACGCCCCCGT CGATGCCGTC CGAGAAATCG TCAGCGAGGA GCTCCCGATC
GCCGAGACTG ACGATCCACG CACGAACCCG GGCGTTGTCC TCGCGAGTGA GTCGGCGGCT
GAGATCCCGG AAGCCGTTGG CGACTTCGCT CGCGATGCCG TCCGGGATTT CCACGACGTC
GCCGACGCCC GCGCCCTGAT CGACCGGCTT GAGTACGACA CCCTGGAGGC GGGCAACGGT
CGCGGGTTGA TCGGCGCGCT CGCGGCGCTC GGGGCCTGGC GGGCGTTCGA GGACTGGACG
TACGAGTACA TCTCCTATCG CGAGCCACCG CGCCGCGGGA CCCCTCGCGA GGTCGGCCCC
GAGTCTGTCT TCCGGGCCGC CGATGCCGGC TACCCGGACG CTTGGGACAC TGTCGATCAC
GTCGAAGACG AACTTGTCTG CGTTCCCCAC GCGCCGGGGC CGATCCTCCA CGGTATCCGC
GGCGACGACC CCGATGTCGT CAGAGGAGTC GCTGCCGACA TCGAGAGCGA ACCGATCGAA
CGAACCGCCC TGTTCGTCAC CAACCAGGGG ACTGACGCCC ACCTACGGCA GGGGACGATC
GGGACACTCC GGGACGGCCG GGCCTACCGG GTGACCGGCG TCGTCGACGC CTCGCCCGAA
ACCCGCGAAG GCGGCCACGT CTTTCTCACC ATCGAGGGAG ACGATGGCCA TGCGTTGCCC
TGTGCCGCGT TCGAGCCGAC CAAGCGCTTC CGCGACCGCG TTCGTTCGCT CCGGGTGGGT
GATCGCGTCA CCGTCTGTGG CGAGGTCAGC GATGGCACGC TCAAACTCGA GAAGTTCGCC
GTCCGGGATC TCGTTCGGAC CGAGCGTGTC ACGCCGACCT GTCCGGCCTG CGGTCGGACG
ATGGAAAGCG CCGGACGGGG CCAGGGGTAT CGCTGCCGGG ACTGTTCGAC GGCTGTCGAC
GGGAAAGCCG AGCGGGAGAT CGACCGCGAT CTAGAGGTAG GGTGGTACGA GGTCCCGCCG
TGTGCCCGCC GACACATCGC CAAACCGCTC GTGCGGGGCG GATTCGACGC GCCGGTCCAC
CCCGAGCGGT GA
 
Protein sequence
MTVIGIDDTD SRTAGMCTTY LAARIADRLA ATDDWVVDRR LLVRLNPAVE YKTRGNAALA 
IHTDAPVDAV REIVSEELPI AETDDPRTNP GVVLASESAA EIPEAVGDFA RDAVRDFHDV
ADARALIDRL EYDTLEAGNG RGLIGALAAL GAWRAFEDWT YEYISYREPP RRGTPREVGP
ESVFRAADAG YPDAWDTVDH VEDELVCVPH APGPILHGIR GDDPDVVRGV AADIESEPIE
RTALFVTNQG TDAHLRQGTI GTLRDGRAYR VTGVVDASPE TREGGHVFLT IEGDDGHALP
CAAFEPTKRF RDRVRSLRVG DRVTVCGEVS DGTLKLEKFA VRDLVRTERV TPTCPACGRT
MESAGRGQGY RCRDCSTAVD GKAEREIDRD LEVGWYEVPP CARRHIAKPL VRGGFDAPVH
PER