Gene Huta_2396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2396 
Symbol 
ID8384695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2451361 
End bp2454513 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content68% 
IMG OID644973469 
ProductFibronectin type III domain protein 
Protein accessionYP_003131295 
Protein GI257053462 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGAG ACAACCATAC TTACGCCGGC GGCGGTGCAG ACCGTCCCGA CGGACGGACG 
TATCGTCCGG ACGACCGACG GTCGGCACTC GCGGCGTCGC GACGGGACGT CCTCCGCACC
ATCGGTGCCG GGGCGCTGCT GGGCTCGATC GGGACGGCAC GCGTTCAGGC AGCACCCGGG
GACCGCGAAT TCGTCGCCAC CGACGGCCCG GAGTTCACCG TCGGCGGTGA GCCGATCTAC
TTCAGCGGGA CGAACAACTT CTGGGTGACC GATCCCTACA GCGATCGCTC GCGGATCGAC
GACGTCCTCG CGCTGTGTGC GGACCTGGAT CAGAATCTGC TGCGGACCTG GGCGTTCTGT
GCGGGCGAGG GCGGCCAGTG TCTCCAGCCC GAACCCGGCG TGTTCAACGA GGCGGCGCTG
CAGCACCTCG ATTATCTCGT CGCGAAGGCC GGCGAACACG GGGTACGACT CATCCTCTCG
CTGGTCAACA ACTGGGACGA CTACGGCGGG ATGGCCCAGT ACATCGAATG GGCGGACGGC
GCAAGCGAGC ACGGCGACTT CTACGTCAAC GAGGCGTGTC GTGAACTCTA CCGGACTCAT
GTCGAGACGC TCCTCACGCG GGAGAACTCG ATCACCGGCG TCGAGTACCG CAATGACCCC
GCGATCGCGA TGTGGGAACT CGCCAACGAA CCCCGACTGG AGGACGACGA CACGGAGACC
ATCGACGACC GGGAGGCTGC CCTCACCGAA TGGTTCGCCG ACATGTCCGG GTTCATCAAG
GATTTCGACG ACAACCACCT GGTGACGACC GGGCTGGAGG GCTTTTACAC CCGCGCGGAC
GGGCCAAACT GGATGTACGG TGACTGGACC GGCCAGAACT TCATCGCCCA CCACGAGATC
GACACGATCG ACGTGTGTTC GTTCCACCTC TATCCGTACC ACTGGCCCGG CATGGGACTG
GCGGGCCAAC TCGCCGAGGA CGACGTCGTC AGTGCCGTCG AGTGGATCCG CGAGCACGCC
GCCGACGCCC GCGAGACGCT CGAAAAGCCC GCGCTGCTTG GCGAGTTCAA CGTCAACGTC
CAGGAACACG ACCTGGCGAC GCGAAACGAT CGGTTGCGGG CGTGGTACGA CGCCCTCGAC
AGCCAGGACG CGGGCGCGGC GGCGATCTGG CAACTGGTGC TCGAGGACAC CGAGGACCAC
GACGGCTTCC AGGTCTACCG GAGCGAGTCC GGTGACATCC TCTCGGGGTA CGCATCGACG
ATCCGCGAGA AGTCCGGGCA CAGCGACGGG ACGCCGACGG CCGACGCGAC GGCACCTTCT
TCGCTCCGAA TCGGCGAGTC CGGCGATTTC AGCGGCACCT ACTCCTTCGA CCCGGACGGC
TCGATCGCCG CCTACGACTG GGCCTTCGAC GACGGCGCGA CGGCCACCGG CGAGCGGGTG
GCCCATCGCT TCGCCGAGAC CGGGTCCCAC GAGGCCGAAC TGACCGTTAC CGACGACAGC
GGCGCGACTG ACGCCGACAT CGAGTCGGTT TCCGTCGAAG GCATCCCGGA AGACTCGTTC
CTCGTCGAGG GCGCGGGAGA GACGTTCCAC CGCGACACCA AGCAGTGCCA CTTCGCGTCG
ATGCCCGCGT CGGGCGACGT GGCGGTCACG GCCCGCGTCG CGGATCTCGA ACCGGTCGAT
CCCGAAACCC AGGCCGGTGT GATGGTGGCC GACGATCCGG ACGCGCCCGG CGCGCTCGGT
GCCGCCACGA TCACGCCCGG CGAGGGGAGC GAACTGACGC GGGCTTACGA CTCGACGGTG
TGGCGCGAGC GTGCCGGCGA CGATCGCACG CCGCCGATCT GGTTGCGCGT CAAGCGGTCG
GGATCGACAG TGTCGGCCTC GGTCTCGCCG AACGGCTCGG ACTGGACGGA GATCGGCTCC
GGCGACGTCG ATCTCCCCGA TGATGTCCAC GTCGGGCTGT TCGTCAGCAG CAACGCCGCC
GGCGAACTCG CCGCCGCGCG CTTCGACGAG GTTGATTGGC TGGAGGACTG GACGGCGACC
GACGTCGGCC CCGTTTCGGT GGCCGGCGCG ACGACCGCCG GCGACGGCAC CACTGACGAC
GGTGATGGCG ACGAGGACAC GACGCCGCCG ACGGCGCCCG GCGATCTGAC AGTGACCGAG
ACGACGGACT CCTCGATTTC GCTCTCGTGG GACGCCGCCA CCGACGACGG TGGGTCGGGC
CTTGCCCACT ACGATGTCTC CGTCGACGGC GCGCTCGACC AGCAGGTCCC CGCTGGCACG
ACGACCGCGA CGGTTGAGGC CCTCGATCCC GGGACGGCCT ACGACATCGG GGTGTCAGCT
GTCGACGGCG CGGGCAACGA ATCCGGGACC GTGACGGTGA CGGCGACGAC CGGGGACGGC
GACGACGAGG CACCGACGGC GCCCGCCGAC CTGACGGCGA CCGAAACAAC GAGTTCCTCG
GTCTCGCTCT CTTGGGACGC CTCGACGGAT TCGGGCGGCT CCGGGGTCGA GCAGTACGTC
GTTGCCGTCG ACGGCGAAAC GGCCCACACC GTCGAGGCCG ACACAACGAG TACGACCGTC
GAGGAACTGG ACGCCGAGAC GACCTACGAG CTCGGCGTCT CGGCGGTCGA CGCGGCCGGA
AACGTGTCCG ACCCGGCCGT CATCGAGGTG GCGACCGCCG AGGGCGACGA TAGCGATGAG
GAACCGCCAG AAAATGCCCT GGTCGTCAAC GACTACGACG GTGATCCGGC GTGGTCCAGC
AATCGCAACG ACCTCGGGAA CTGGTGTGGG GCCGGCTCCT TCGCAAACGG CGGTGGCGAT
GTCGAAGATG GCGCACTCGT CCTCGAATAC GACAACGCCG GGTGGTTCGT CGAGCAACTC
AACCAGGATG TCTCCGCGCA CTCCGAACTG GTGTTCGTCG TCAGTGGTGC GAGTGGCGGC
GAGGGCGATC ACTTCGTCGT CAGCGCCGGC GGTGTCCGCT CGCGGTTCAG CGACGTGGCG
GACGGGTCGA TCGACACCGA TCCGAAGCCG ATCGCGATTG ACATGGAATC GGCAGGGATC
GACGCCACGT CGCCCGGGGA ATTGCGTTTG AACTTCTGGC AGGGTGGGTC CGGAAGTGGG
GCCCTCCGCA TCGAGGAGAT CAGACTGGAG TAA
 
Protein sequence
MARDNHTYAG GGADRPDGRT YRPDDRRSAL AASRRDVLRT IGAGALLGSI GTARVQAAPG 
DREFVATDGP EFTVGGEPIY FSGTNNFWVT DPYSDRSRID DVLALCADLD QNLLRTWAFC
AGEGGQCLQP EPGVFNEAAL QHLDYLVAKA GEHGVRLILS LVNNWDDYGG MAQYIEWADG
ASEHGDFYVN EACRELYRTH VETLLTRENS ITGVEYRNDP AIAMWELANE PRLEDDDTET
IDDREAALTE WFADMSGFIK DFDDNHLVTT GLEGFYTRAD GPNWMYGDWT GQNFIAHHEI
DTIDVCSFHL YPYHWPGMGL AGQLAEDDVV SAVEWIREHA ADARETLEKP ALLGEFNVNV
QEHDLATRND RLRAWYDALD SQDAGAAAIW QLVLEDTEDH DGFQVYRSES GDILSGYAST
IREKSGHSDG TPTADATAPS SLRIGESGDF SGTYSFDPDG SIAAYDWAFD DGATATGERV
AHRFAETGSH EAELTVTDDS GATDADIESV SVEGIPEDSF LVEGAGETFH RDTKQCHFAS
MPASGDVAVT ARVADLEPVD PETQAGVMVA DDPDAPGALG AATITPGEGS ELTRAYDSTV
WRERAGDDRT PPIWLRVKRS GSTVSASVSP NGSDWTEIGS GDVDLPDDVH VGLFVSSNAA
GELAAARFDE VDWLEDWTAT DVGPVSVAGA TTAGDGTTDD GDGDEDTTPP TAPGDLTVTE
TTDSSISLSW DAATDDGGSG LAHYDVSVDG ALDQQVPAGT TTATVEALDP GTAYDIGVSA
VDGAGNESGT VTVTATTGDG DDEAPTAPAD LTATETTSSS VSLSWDASTD SGGSGVEQYV
VAVDGETAHT VEADTTSTTV EELDAETTYE LGVSAVDAAG NVSDPAVIEV ATAEGDDSDE
EPPENALVVN DYDGDPAWSS NRNDLGNWCG AGSFANGGGD VEDGALVLEY DNAGWFVEQL
NQDVSAHSEL VFVVSGASGG EGDHFVVSAG GVRSRFSDVA DGSIDTDPKP IAIDMESAGI
DATSPGELRL NFWQGGSGSG ALRIEEIRLE