Gene Huta_2390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2390 
Symbol 
ID8384689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2437095 
End bp2438666 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content63% 
IMG OID644973463 
ProductCarbohydrate-binding family V/XII 
Protein accessionYP_003131289 
Protein GI257053456 
COG category[R] General function prediction only 
COG ID[COG3979] Uncharacterized protein contain chitin-binding domain type 3 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01634] phage tail protein, P2 protein I family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGAC GCACGAACGA TACCGGCGAA GTAGATGAGA AACCAAGTAG CGGTGCTGAG 
CAGCAAGGTT CGAACGACTC GACCGGCTCC AGGGACCCGT CTCGACGTGA CTTCCTGAAG
GCCGGTGCCG CGGTCGGTGC AGGGACGTTC GCGGTCGGAC TCGGGCAGCA GGCCACGGCG
ACCACGGCGA CGGACCCGTC GAATCTTGAT CTGTACCTGC TGTTCGGCCA GTCGAACATG
GAGGGACAGG GACCGATCGA AGCCCAGGAC AGGGAGACCC ATCCGCGGAT CCACGTCCTC
GCTGACAAGA CCTGTCCGAA CCTCGATCGC GAGTATGGCG AGTGGTATCT GGCCGAACCG
CCGCTCAACC GATGCTATGG GAAGCTCGGT CCCGGCGATT ACTTCGCGAA GTCCATGATC
GAGGAGATGC CGGACGACCG GTCGATCGGT CTCGTTCCCG CAGCCGTCAG CGGGGCCGAC
ATTGCTCTCT TCGAAAAGGG GGCACCGATC GGTCGGAACG ACCGCGACAT CCCCTCCCAG
TTCGACGGCG GCTACGAATG GATGGTCGAT CTTGCCGAAA CGGCCCAGCA AGTCGGGACG
TTCAGGGGCA TTCTGTTCCA CCAGGGCGAG ACGAACACGA ACGATCAGCA GTGGACCGAT
CAGGTCCAGG GTATCGTCGA GGATCTCCGC GCCGACCTCG GTATCGGCAA CGTCCCGTTC
CTGGCGGGTG AGATGCTCTA TGACTCGGCT GGGGGGTGTT GTGGCTCGCA CAACACTGAA
GTCAACGAAC TCCCGGACGT CATCGAGAAC GCTCACGTCG TCTCGGCTGA AGGACTTGCC
GGCCAGGATT ACGCGCACTT CACGTCCGAA GCGTATCGAG AACTCGGCCG TCGCTACGCT
GCGGAGATGC TGGAACACGT CGACGTCAGC GGCGGGACCG ACGACGGATC CGGCGGCAAC
TCCGGTGATG ATTCGGGTGG CAACGATGGC GATGGGTCGG GCAGTGACTC CGATGATGAC
TCGGACAGTG ACACTGGCGA CTCCGGCGAT GATTCGGGCA GTGATACCGG CGATAGTTCG
GGCGATGACG CCGGTAGCGA CTCAGGGGGT TCCAGCGAGT ATCCCACGTG GGATTCAACT
GCCGTTTATC GCACCGGCGA TCGGGTCGTC CACGACGGAC GCGTCTGGGA GGCCCAGTGG
TACACCCAGG ATCAGGAACC CCGCGAGGAG GACTACTACG TCTGGCAACC TGTCGAGGAC
GAAAGCGCCG GTAATTCCGG CGGTGACACC AGCGGGGAAT CGGGTGGTGA CACCGGTAAC
TTGAACGCCG AGATGGATCC GAGCACGACA GCGGCCAGTG TCGGTGAGCG GGTCACGTTC
CGCGTCACCG ACACGAGCGG TTCGAGCAAT TGGCTCACTT CTCTGGCGTT CGATTTCGGA
GACGGGATGA CAGCCACCGG GTGGTGGGCT GCCCATTCCT TCGATTCGCC GGGCACCTAC
ACCGTCACGC TCACCGCGAC CGACAACGGG GGTGCATCGA CCACTCACGA GGTGACGATC
ACGGTCTCGT AA
 
Protein sequence
MTRRTNDTGE VDEKPSSGAE QQGSNDSTGS RDPSRRDFLK AGAAVGAGTF AVGLGQQATA 
TTATDPSNLD LYLLFGQSNM EGQGPIEAQD RETHPRIHVL ADKTCPNLDR EYGEWYLAEP
PLNRCYGKLG PGDYFAKSMI EEMPDDRSIG LVPAAVSGAD IALFEKGAPI GRNDRDIPSQ
FDGGYEWMVD LAETAQQVGT FRGILFHQGE TNTNDQQWTD QVQGIVEDLR ADLGIGNVPF
LAGEMLYDSA GGCCGSHNTE VNELPDVIEN AHVVSAEGLA GQDYAHFTSE AYRELGRRYA
AEMLEHVDVS GGTDDGSGGN SGDDSGGNDG DGSGSDSDDD SDSDTGDSGD DSGSDTGDSS
GDDAGSDSGG SSEYPTWDST AVYRTGDRVV HDGRVWEAQW YTQDQEPREE DYYVWQPVED
ESAGNSGGDT SGESGGDTGN LNAEMDPSTT AASVGERVTF RVTDTSGSSN WLTSLAFDFG
DGMTATGWWA AHSFDSPGTY TVTLTATDNG GASTTHEVTI TVS