Gene Huta_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2154 
Symbol 
ID8384448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2206397 
End bp2207845 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content66% 
IMG OID644973223 
Productprotein of unknown function DUF58 
Protein accessionYP_003131054 
Protein GI257053221 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.690685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGTGA CGCGACGGTT CTGGGCCGCG GTCGGGGCCG GTGGAGTCCT CTCCGTTCTC 
GGACTGGTCT TTGCCCGGCC GATCCTGGTT GTTGGTGCGG GGGGGATCTG GGGGCTCGTT
GTCGGCGCGC AACTCGTCTT TGTGCTTCGG CTCCTCGCAC TCGATCGATC GATGACGATC
ACCCAGACGC TCGACAGCCC GTTCGTCACC ACCCGCGGGA CCGTCCGGTG GTCACTCGAA
GCGACCCTCG CTCAACCAAC TCCACTCGAG GTGTCCATCG TGCCGACGTT TCCGGTGACT
CTTGACGTCT CGAAACAGCC ACGTGTGACG ATCCCACCGG GAGAGACGGG GGGGTGGGCG
GACGCCACGG TGACCGCGAC GGTTGCCGGG ACGACGACGA TCCCACGGCC GACCGTTGCA
GTCTCGGGCA CCTGGGGGCT GTTCGGCGAA CAGTTTCGCC GCGGACCGAC GACGGACCTC
ACCGTCGAAC CTCGACAGGT CGGTGACGTT CACGTCGGGC AGGGTGGCGA GTCGGTGATC
GCGACGCCGG GTGGCAGGCA TCGGACTGGC GAAATCGGAT CCGGTATCAG CCCGGCGGAA
GTCCGCGAGT ACGTTCCAGG GGACACTGTC AGTGACATCG ACTGGAAGGC AACGGCACGC
CTGGCGTCCC CGCATGTCCG GGAGTTCGAA GTCGAGACCG ACCGGCAGAC AGTCCTGCTT
TTCGACCGCC GGAGTCGGCT GGAAAGCGGC CCTGGCGGCG AATCGATGCT CGCATATCTC
CGGGAGGTGG CACTCAGGTT TGTCTCGGCC GCGGCGAACC TCGACGATCC GCTCGGGCTC
TATGCGATCG GTGACGGCGG TGTGACGGAC GAAGTCATGC CGCGGGCGGA CGAACGGACC
TACGAACACA TCCGGTCACG ACTCCAGACC GCGACGCCGA CCGGTGGGGC CGAGACGGCG
GATGCTTCGA CAGCGATGGA GGCCGATCTG ATCGGTCCCG GAACCGCCCG GCGGAACGCG
ACACGGCTTC GCGAGGCGGC GTCCCCGTAC GCCCGGTCTC TCCACCCGTT CTTCGCGGAC
GGCACCCGGT ACGTCCGGCA GATCGCCGAT CGACCGCTGT TTGGGGCTGG GAAAGCCTAC
CTGCCCCGGA TCGACGGCGA GATGTGGGTC GTTATATTCA CTGACGACCG TGACCGAACC
GAGGTGCGAG AGACAGTCAA ACTGGCCAGA GAGCACGGGA GACGGGTCGT GGTATTTCTT
GCGCCGGGGG CGCTGTTCGA GACGGAGCTG GTCGGTGATC TCGATGCTGC GTATACCTCC
TACCGTGGGT TCGAGGAATT CAGGCAGACA CTCGCTGGAC TCGATCGGGT CGAGGCGTAC
GAAGTTGGGC CAGGTGATCG CGTGGAAGCA CTGCTTTCAA CACGCCGCGA GCAAGGGGGA
CGACAATGA
 
Protein sequence
MEVTRRFWAA VGAGGVLSVL GLVFARPILV VGAGGIWGLV VGAQLVFVLR LLALDRSMTI 
TQTLDSPFVT TRGTVRWSLE ATLAQPTPLE VSIVPTFPVT LDVSKQPRVT IPPGETGGWA
DATVTATVAG TTTIPRPTVA VSGTWGLFGE QFRRGPTTDL TVEPRQVGDV HVGQGGESVI
ATPGGRHRTG EIGSGISPAE VREYVPGDTV SDIDWKATAR LASPHVREFE VETDRQTVLL
FDRRSRLESG PGGESMLAYL REVALRFVSA AANLDDPLGL YAIGDGGVTD EVMPRADERT
YEHIRSRLQT ATPTGGAETA DASTAMEADL IGPGTARRNA TRLREAASPY ARSLHPFFAD
GTRYVRQIAD RPLFGAGKAY LPRIDGEMWV VIFTDDRDRT EVRETVKLAR EHGRRVVVFL
APGALFETEL VGDLDAAYTS YRGFEEFRQT LAGLDRVEAY EVGPGDRVEA LLSTRREQGG
RQ