Gene Huta_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1972 
Symbol 
ID8384266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1994926 
End bp1996206 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID644973042 
ProductProtein of unknown function DUF650 
Protein accessionYP_003130873 
Protein GI257053040 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.177811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTTG AGGACTACAT CGAGGGATTC GAGCGCGACG AGGCCGCCGA GAAGCGACGC 
CTCGCCGAGG AGAAGTCCTA CGCGATCACC GATCACCTCG AAGACGTCGA GCGCCAGCTC
GAAGAAACCC TGCAGGGTGA CGCGCTCTTT GGCTCGACCG CGCCCGAGAT CTTCGTCGGG
CGGTCGGGCT ACCCGAACGT CTCCTCCGGC GTGCTCTCGC CGGTCGCCGA CGAGGGCGAC
CCCACGGACT TTGCGACCAG CGGCCAGTGG TACGCCAACG GCCTGGGGAT CGAGGACGTC
CTCCAGCGTC GGACGGGCCT GCTCAACTCC CAGCGCTCGG CGAAGGTGGA CGTGAATGAC
GTCTGGGATG GGTTCGTCGG CACCCAGCGT GAAGTCGCCA TCGCCGACCG GCCCGTCGAC
GTCGAGATCG GGCTGGACGG GACGCCCGAT TTCGACCTGA CCACTGACGA CATCTCCACG
CCCACGGGCC CGCGGGCACG GGCGACCGAA GCCACACTCG CCGAGAATCC CCACGTCCCC
CGTCCCGTCG AGAAGACCCT CGAAGACGAC GACTGGCGCG CCGAGGGCGC GATGACCTAT
CTCTACCGGA AGGGCTTCGA CGTCTACGAC GTCAACACCA TCCTCTCGGC GGGCGCGCTG
GGGCAAGGAG CCAACCGGCG ACTCGTCCCG ACACGGTGGT CGATCACCGC CGTCGACGAC
ACGGTCGGGC AGTATCTGCA TGGCCAAATC CGCAACGCGA ACACCATCGA CGAGACCCAG
GTCTGGTACA ACGAGTACAT GGGCAACCGC TACTGGATCA TCCTCACGCC CGGCGACTGG
GAGTTCGAGC TCGTCGAGAT GAAAGCCCCC GAGAGCGTCT GGAATCCCCT CGGGGAGACC
CACTACCTCG CCAGCGCCCA CGAGGGCTAC GAAGGGCGGA CGAGCTACGT CGAGGAGACC
GCCGGGGCCT ATTACGCATC CCGACTCGGC GTCCTCGAAC ACCTCGTCGA CATCGATCGA
CAGGCCAAGT GTCTCGTGCT CCGGGAGGTG ACCGATGACT ACTGGGCCCC GGTCGGCGTC
TGGCAGGTCC GGGAAGGAGT CCGCAACGCC TTCGAGGATC CGGAGGGTCT GCCCGACGCG
CTTTCGGGCC GATACGGCGA GGCCGGGAGT TTCCGCGATG CAGTGACCAG CGTGACCGAG
CAACTGCCGG TGTCGCTGAC TGCGCTCCGT CGGAAGTCCG AGATGGTCGC CGGCCTCCAG
GCGACGCTGT CGGACTTCTG A
 
Protein sequence
MRLEDYIEGF ERDEAAEKRR LAEEKSYAIT DHLEDVERQL EETLQGDALF GSTAPEIFVG 
RSGYPNVSSG VLSPVADEGD PTDFATSGQW YANGLGIEDV LQRRTGLLNS QRSAKVDVND
VWDGFVGTQR EVAIADRPVD VEIGLDGTPD FDLTTDDIST PTGPRARATE ATLAENPHVP
RPVEKTLEDD DWRAEGAMTY LYRKGFDVYD VNTILSAGAL GQGANRRLVP TRWSITAVDD
TVGQYLHGQI RNANTIDETQ VWYNEYMGNR YWIILTPGDW EFELVEMKAP ESVWNPLGET
HYLASAHEGY EGRTSYVEET AGAYYASRLG VLEHLVDIDR QAKCLVLREV TDDYWAPVGV
WQVREGVRNA FEDPEGLPDA LSGRYGEAGS FRDAVTSVTE QLPVSLTALR RKSEMVAGLQ
ATLSDF