Gene Huta_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2394 
Symbol 
ID8384693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2445728 
End bp2447683 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content66% 
IMG OID644973467 
ProductFibronectin type III domain protein 
Protein accessionYP_003131293 
Protein GI257053460 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.253992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAG ACCGTTCGAC GGAACGAACC GAAACTGACG AATCGACGAC TGAACGAGAC 
GAATTCACGC AGGAGGGCCC CGAGACGTAT CGGGCAGGGA TATCCAGACG GTCGTTTCTG
CAGACGACCG CGGCTGCGGG ACTGGTCGGC CTGGGCGTCG GGAGTGGCGC TGTCGGCTCA
GCGGCTGCAG CCGGTATTCC AACGCCGTGG CTCGAAGTCG ATGGCAATCT CCTGCGGGAT
CCCGACGGCA ACAAGGTGAT CCTGCGGGGT GTGAACGTTA TCGACCCCGC GCGGGCGGCC
AAGGAGTGGC GCAAGAACAT CGAGCCGCTG ATCGAGTTGG CGACCGATCC GGGCGAGGGC
TGGCACGCCC ACGTCATCCG GCTCCCGATG CAGCCTCAGG ACATCGGCGA TCATGGTCCG
GGGACGGCGG CCCCGACGCC GGGATTCACG CAGGACGAAC TCCAGAATTA TCTCGCGGAG
TACGTCGATC CGGCGGTCGA CGCGGCCGAG GACGTCGGCG CGTACATCAT GCTGGATTAC
CATCGCCACT ATCCGGAGGG GCCCGACTGG GACTCGCCGG AACTCGACGA GGAGATCCGG
TTGTTCTGGA ACGAGGTCGC CCCGCGCTAC AGCGATCGTT CCCACGTCAT CTACGAACTG
TACAACGAAC CGAACACGCC GTATCCGGGG GCCGGCGATC CGACCGACGA CGTTGGCGTC
ACGGACGCTC GTGCCGAGGA GAACTACCTC TACTGGCGCG AGACGGCCCA GCCGTGGGTC
GATCTCATTC GGGAGCACGC GTCCCGGAAC CTGATCGTCA TCGGGTCGCC GCGCTGGAGC
CAGTTCACCT ACTGGGCGGG CGAACACGAG TTCGAGGGCG ACAATCTCGC GTATGCGGGC
CACGTCTACG CCCACGAGAA CCTCCGGCCG CTATCGACGT ACTTCGGCGA GCCCTCAGAG
GAGGTTCCGG TGTTCATGAG CGAGTTCGGG TACGGGACCG AGGGCTCGCC CTACCTCGTC
GGGACCAACG AAGTCGAGGG CCAGCAGTTC CTCGACCTCT TCGACGCCCA CGACATCCAC
TGGCAGGCCT GGTGTTTCGA CCACACGTGG TCGCCCGGCA TGTTGAATCG GGATTACGAG
GTCGACAGTC CCCACGGTCG GCTGTTCAAG GAGCGACTTC GCGAGAAGCG CAACGACGAC
CTGCCGGCGA GCGCCGGCGG TGGCGACGAG ACGCCGCCCT CGGCCCCGTC GAACCTCGCC
GTGACCGAGA CGGGCAGCGA GAGTGTCGGT CTGGCGTGGG ACGCCGCGAG TGATTCCGGC
GACTCCGGCC TCGCCACTTA TGCCGTCTAC CTCGACGGCG CGCTGGATCA TCGGGTCACT
GCCGGGACGA CCGCTACAGA AGTCAGCGGC CTGCTGCCGG AGACGACCTA CGAGTTCGCC
GTCAGTGCCG TCGACGGCGC GGGCAACGAG TCCGACAGGA GCGGGGTCGT CACCGCCACC
ACCGATCCGC CGGCCAGCGA GCGCCTGGTC CTCAACGACT TCGACGGCGA CCCGGCGTGG
GCTGACAGTC GCAACGAACT CGGGAACTGG TGCGGTGCCG GTTCCTTCGC AAACGACGAC
GGCGAGGTCG TGGACGGCGC ACTCGTCCTC GAATACGACG GCGGCTGGCT GCAGTCGTAC
GTTCGCCAGG ATGTCTCGTC GTTTTCGACG CTGAATCTGC AGGTTCGCGG TGCCGACGGT
GGCGAGGAGT CGGCCTTCGC GGTGGAACTC GGCGGCGGGG GCGGCGTGCT CGCCGAAATC
ACCGACGACA CGATCGGCAC GTCGTTCTCG ACAGTATCGA TCGACATGGC CGCCGCCGGG
ATGGACGGGG CGAGTCCCGG CGCGGTATAT CTCGACTTCT GGTCGGGTGA CGGAACGAGT
GGAACGATCG AGATCGACGA AATCTGGTTC GAATAG
 
Protein sequence
MTKDRSTERT ETDESTTERD EFTQEGPETY RAGISRRSFL QTTAAAGLVG LGVGSGAVGS 
AAAAGIPTPW LEVDGNLLRD PDGNKVILRG VNVIDPARAA KEWRKNIEPL IELATDPGEG
WHAHVIRLPM QPQDIGDHGP GTAAPTPGFT QDELQNYLAE YVDPAVDAAE DVGAYIMLDY
HRHYPEGPDW DSPELDEEIR LFWNEVAPRY SDRSHVIYEL YNEPNTPYPG AGDPTDDVGV
TDARAEENYL YWRETAQPWV DLIREHASRN LIVIGSPRWS QFTYWAGEHE FEGDNLAYAG
HVYAHENLRP LSTYFGEPSE EVPVFMSEFG YGTEGSPYLV GTNEVEGQQF LDLFDAHDIH
WQAWCFDHTW SPGMLNRDYE VDSPHGRLFK ERLREKRNDD LPASAGGGDE TPPSAPSNLA
VTETGSESVG LAWDAASDSG DSGLATYAVY LDGALDHRVT AGTTATEVSG LLPETTYEFA
VSAVDGAGNE SDRSGVVTAT TDPPASERLV LNDFDGDPAW ADSRNELGNW CGAGSFANDD
GEVVDGALVL EYDGGWLQSY VRQDVSSFST LNLQVRGADG GEESAFAVEL GGGGGVLAEI
TDDTIGTSFS TVSIDMAAAG MDGASPGAVY LDFWSGDGTS GTIEIDEIWF E