Gene Huta_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0140 
Symbol 
ID8382402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp136861 
End bp138270 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content62% 
IMG OID644971198 
Producthypothetical protein 
Protein accessionYP_003129061 
Protein GI257051228 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGGGG CGGGATTTAC GGCGGTCGGG GGCAGTCTCG CGGGCTGTAC AGACGATTCG 
ACACCGACAG AAGACACTGA GACGGCTGAG ACCACTGATA CCGAATCGAC GACGGAGAGT
CCCGAGACTG ACGAGCCACC GTCGACGACC GAGAGCGAAA CAACGACGAC GGAAGCGGAC
GAAACAACTA CCGAAAGCGA TGAGGAGGGT TTGCCGGCGT CAGTGGGCGT CGAGCAAGTC
GCGGGAGGAT TCACGGCACC TGTAAGCGTC ACGTTTCCAC CAGAGGACGG CGTGGTTCTC
GTTGCCGATC AGGTGGGAAC GATCCACGTC GTTTCGGACG GGAGCGTCCG GGACGAACCA
CTGATCGATA TCAGGGATCG GATGATCGAC GTCTCGGGCT ACGACGAGCG AGGGTTGCTC
GGCTTTGCGC TCCATCCGGA CTATCCCGCG GACGATCGCC TGTTCGTTCG GTACAGTGCG
CCACCGGGTG AGGCGACACC GGAGGACTAC TCTCATACGT TCGCGCTCTC CTCGTTTTCG
ATCGAGACGG ACACGCTCGC TGCTGACACT GACACCGAAC AGCGGATACT CGAATTTCCG
GAACCCCAGA CCAATCACAA CGCAGGCGCA CTCGAATTCG GGCCCGATGG ATATCTCTAC
ATCGCCGTCG GTGACGGCGG TGGGGCCGAC GACACTGGCA CTGGGCACGT TTCCGATTGG
TTTGCTGCAA ATTCCGGCGG GAATGGACAG GACGTCACCG AGAATCTTCT GGGCGGTGTG
CTCCGGATCG ACGTCACCGA AACCGGCGAG GAACCCTATG CGATCCCCGA GGACAATCCG
CTCGTGGGGA CGGATGGGCT CGACGAGTAT TACGCGTGGG GATTACGCAA CCCCTGGCGG
ATGGCGTTTC ACGACGGCGA GTTGTACGCG GCGGACGTCG GCCAGGGTCG ATTTGAGGAG
GTCAACCGCG TCACGAACGG GGGGAACTAC GGCTGGAACG TCCGGGAAGG GACACACTGT
TTCTCGCCCG GGTCGTCGAA TGGGTCTTGC CCGATCGAGA CACCGGATGG CGAACCCTTG
CTCGACCCGG TGATCGAGTA TCCTCACAGC GGCCAGCCGG TTAGCGGCGT CGCGGTGATC
GGGGGACAGT TCTATACGGG CGAGTCGATC CCTGGGCTCC GTGATCGGTA CGTCTTCGCC
GACTGGCAGG CCAACGGGAC ACTCTTTGTC GGCACCCCAA CGGAGGACGG GCTCTGGGAG
ACCACGACGA TTTCGGTGGA TGACAGCGAA TTTGCCCCGA TGATCCTGGC GTTCGGCCGC
GATCAGGCTG GCGAGCTTTA CGTCTGTGCC AGCGAGCGCG GACAGTTGGT CGGCTCGACG
GGTGCTGTCT ACCGACTGAC ATCGGCGTAA
 
Protein sequence
MLGAGFTAVG GSLAGCTDDS TPTEDTETAE TTDTESTTES PETDEPPSTT ESETTTTEAD 
ETTTESDEEG LPASVGVEQV AGGFTAPVSV TFPPEDGVVL VADQVGTIHV VSDGSVRDEP
LIDIRDRMID VSGYDERGLL GFALHPDYPA DDRLFVRYSA PPGEATPEDY SHTFALSSFS
IETDTLAADT DTEQRILEFP EPQTNHNAGA LEFGPDGYLY IAVGDGGGAD DTGTGHVSDW
FAANSGGNGQ DVTENLLGGV LRIDVTETGE EPYAIPEDNP LVGTDGLDEY YAWGLRNPWR
MAFHDGELYA ADVGQGRFEE VNRVTNGGNY GWNVREGTHC FSPGSSNGSC PIETPDGEPL
LDPVIEYPHS GQPVSGVAVI GGQFYTGESI PGLRDRYVFA DWQANGTLFV GTPTEDGLWE
TTTISVDDSE FAPMILAFGR DQAGELYVCA SERGQLVGST GAVYRLTSA