Gene Huta_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1400 
Symbol 
ID8383679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1371611 
End bp1373164 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID644972463 
Productglycoside hydrolase family 43 
Protein accessionYP_003130309 
Protein GI257052476 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.587173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG GCCGCGAGCC CTCGGGTATG CAGTATTCGA ATCCAGTGCT TCCGGGCATG 
CATCCCGATC CGACGATCGC GCGCGCCGGC GAGGACTTCT ACCTCGCCAT CTCCTCGTTC
GAGTACTTCC CGGGCGTCCC GCTGTTTCAC AGCACGGACC TGGTCTCCTG GGAGCGGGTC
GGCCACTGTC TCACCCGCGA CAGCCAGCTT GACCTCCGCG GTCGCGAGGC TTCCGACGGG
ATCTACGCCC CGACCTTGCG CCACCACGAC GGCACCTTCT ACATGGTCAC GACCGACGTC
GGCGGCGACA ACGGTGGCCA CTTCATCGTG ACCGCCGACG ATCCCGCCGG CGAGTGGTCC
GATCCGCTGT ACGTCGACGC GGGCGGGATC GATCCCGATC TCTTCTTCGA CGACGATGGG
ACGGCCTACT TCCAGTACAC CGACGGCGAG TCCTTGCCGG AGTACCGGGT CCGACAGGCC
GAGATCGACC TCGACACCGG TGAACTCGGC GACGTCCGCC AGCTCTGGCG GGGCATCGAA
GGCGGGTTCG CCGAAGCGCC CCACATCTAC GAACGCGACG GGACCTACTA CCTCATCACC
GCCGAGGGCG GGACCCACAC GGATCACATG GTCACCGTCG GCCGGAGCGA CGACCCGACG
GGCCCGTTCG AACCCCATCC CGACAACCCC GTGCTCTCCC ATCGCGGTCG GCCGATGCAC
CCGCTCAGCG CGATGGGCCA CGCCGACATG GTCCAGGCCC CGGACGGCTC GTGGTGGATG
GTGTTCCTCG GGATCCGCCA GTACGGCCCG AACCCTGGCG TCCACCACCT GGGCCGGGAG
ACGTTCCTCG CACCCGTCAC CTGGGAGGAC GGCTGGCCGA TCGTCAACGA CGGCGAGCCG
ATCGACCCCG AGATGACCGT CGAGTCGCTG CCCGGCGATT CACCCGGCGG GCTGTCGAGT
CCGGCCCGAC CGTTCGAGAC GACCTTCGAC GGCGAACTCG ACGACAGCTG GCAGTTCCGA
CGGAACCCGG ACCCGGCGAC GTATTCACTT TCGGACGACG GACTGTCACT CGTGGGCAAA
ACCGACAGCC TCGACGAGTT GGATGCGACG TTCGTCGGCC GCCCGCAGTC CCACTTCGAT
TGCCGAGCCG AGATCGATCT CGGGTTCGAT CCCGACGACG GCGAGGAGGC CGGTCTCGCG
CTCGTGATGA ACGAGTCCCA CCACTACGAG ATCGGCGTGG GTCGTGAGGG TGGCGAGACG
GTCGCCCGCG TCCGACTCCG GATCGGCGAG GTTGCTGACG AGGTCGCAGC TGTCCCCGTC
GGGGGCGAGG ACCACCGGCT GGTCGTCGAC GCGACGACTG AGGAATACAC GTTCCGCTAC
GCCGACGGCG ACGGCGAACC GACCGAACTC GCCACGGCGG CCACACGGTA CCTGTCGACG
GAAGTCGCGG GCGGGTTCAC CGGCGTCTAT ATCGGTCCGT ACGCACTTGG AGCAGGAACA
GAAACGGCAA CGCCGGCCCA GGTCCAGCGC TTCGTATACG AGCCGGCGGA GTGA
 
Protein sequence
MSDGREPSGM QYSNPVLPGM HPDPTIARAG EDFYLAISSF EYFPGVPLFH STDLVSWERV 
GHCLTRDSQL DLRGREASDG IYAPTLRHHD GTFYMVTTDV GGDNGGHFIV TADDPAGEWS
DPLYVDAGGI DPDLFFDDDG TAYFQYTDGE SLPEYRVRQA EIDLDTGELG DVRQLWRGIE
GGFAEAPHIY ERDGTYYLIT AEGGTHTDHM VTVGRSDDPT GPFEPHPDNP VLSHRGRPMH
PLSAMGHADM VQAPDGSWWM VFLGIRQYGP NPGVHHLGRE TFLAPVTWED GWPIVNDGEP
IDPEMTVESL PGDSPGGLSS PARPFETTFD GELDDSWQFR RNPDPATYSL SDDGLSLVGK
TDSLDELDAT FVGRPQSHFD CRAEIDLGFD PDDGEEAGLA LVMNESHHYE IGVGREGGET
VARVRLRIGE VADEVAAVPV GGEDHRLVVD ATTEEYTFRY ADGDGEPTEL ATAATRYLST
EVAGGFTGVY IGPYALGAGT ETATPAQVQR FVYEPAE