Gene Huta_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2022 
Symbol 
ID8384316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2041717 
End bp2042655 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content66% 
IMG OID644973092 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003130923 
Protein GI257053090 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCAGG ATCGAACGCC GACTGTGCGC TATCGAAACG CCACCGCGTT TCTTGCTCTC 
GGCGCGATCT GGGGGAGCGC GTTCGTCGCG ATCAAGGCCG GCCTGTCGGC GTTCCCGCCG
GTTCTGTTCG CCGCACTCCG CTACGATGTC GCCGGCGTCA TCGTGCTAGG CTACGCCGCG
GTCGTCACTG ACCCGCTGCC CGAGAGCCGT CGCGACCTGG CGGCGATCAT CGTCGGGTCG
ACTCTCCTCA TCGCGGGATA TCACGCCCTG CTGTTCGTCG GCGAACTCGA AACCACGAGC
GCGACCGCAG CGGTCATCGT GAGTCTCTCG CCGGTACTGA CGGCCGGGTT CGCCCGGCTT
GCCCTCCCGG GGGACCGTCT TTCGGTTGCC GGCGTCGCCG GACTCGCCCT GGGGTTCGCT
GGCGTCGTCG TCATCGCCCA GCCTGATCCC GCTCGACTCC TCTCCAGTGA CGTCATCGGG
CCGCTGCTCG TCTTTGGCGC TGCGTGCGCC TTTGCCCTGG GAAGTGTGCT CACCCGCTGG
CTCGACGCTG AACTGTCGAT CGAAGCCATG GAAGGGTGGT CGATGGTCGG CGGAGCCGTG
CTGATGCACG TCCTCAGTCT CGCGCTCGGG GAGTCACCGG CCGCAGTCGA GTGGACGCCG
ACTGCCCTGC TTTCGCTCGG CTATCTCTCG CTGGTCGCGA GTGCGCTGGG CTTTCTTCTC
TATTTCGCCC TGCTGGATCG ACTCGGCCCG GTCGAGATCA ACCTCGTCTC CTACGTTGCG
CCCGTCTTCG CCGCGCTGAC TGGCTTTCTC CTGCTGGGGG AACGCATCGA CGTCGCGACG
GCTTCCGGGT TCGTCGTCAT TCTGGTTGGA TTTGTCCTGC TAAAACGGGA TGCGATCCGT
GAGACGTATG TCGGTTGGCT GGCGGAAGCG CAACCGTAG
 
Protein sequence
MPQDRTPTVR YRNATAFLAL GAIWGSAFVA IKAGLSAFPP VLFAALRYDV AGVIVLGYAA 
VVTDPLPESR RDLAAIIVGS TLLIAGYHAL LFVGELETTS ATAAVIVSLS PVLTAGFARL
ALPGDRLSVA GVAGLALGFA GVVVIAQPDP ARLLSSDVIG PLLVFGAACA FALGSVLTRW
LDAELSIEAM EGWSMVGGAV LMHVLSLALG ESPAAVEWTP TALLSLGYLS LVASALGFLL
YFALLDRLGP VEINLVSYVA PVFAALTGFL LLGERIDVAT ASGFVVILVG FVLLKRDAIR
ETYVGWLAEA QP