Gene Htur_4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4495 
Symbol 
ID8745124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp92258 
End bp94576 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content66% 
IMG OID646515032 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_003405979 
Protein GI284172597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTGG CCGGGGCCGC GACCGGAACG GCGGCGGCCG ATTCGCACAC TGGAGCCGAA 
TCCGGTGACG GCATCCGGAA CACCGTCTCG CTGGACCTCT CGGAACGGAC CGAAGACGAG
GTGTCCGACC AGTTGTTCGG CCGGCTCTGC GAGCACTACG AGTCGGGGAC GATTTACCCC
GGCGTCTACT CCGAACACGT CAAGAACAAC TCGTTCTATC CGAGAACGTG GTCGGAAGAC
GATTACTTCG GCCCGAAGAC GCGCTTCGAT CCCGAATCGA TAGCACGCCA CGAGAACGTC
CCGTTCCCGT GGGAACCCGT CGATAACTCC GGGGTCTCTT TCGAGCAGCG CGAGGGCGGC
GTCGCCGCCG TGGATACGAC GAACTACCAG CGGGTCTCCC TCGAGGACGC CCGTGGAGGG
ATCTCACAGA AGATCGTCCT GCCAGACTTC CGCACGCTGG GCTACGACCT CTCGTTCTCG
GTGCGCGGCG ACGGTCTCGA GTCGGTCACC GCCGCCATCA CGACGCTTGA CGGTGAGACC
CTCGCGACCG CCGACGTCGA CGTCACCGAC GACTGGACCC GCCACGAGCT CGCGCTCGAA
TTGAGCGAGG CCAGCGGCGA TCAGTACGTC GCCGGCTCGG TCGCGAACGT CGACACCCCC
TACGGGGAGT ACGTCCTCGA GTTCACCGCC GAGGGAAGCG GTCACGTCGA TCTCGACTGG
ATCACCCTCG GGGCCGACGA CGCGATCAAC GGCAAGTTCA ACCCGTCGAC CGTCGAGTTG
ATGCGAGAGC AGAACACGAC CTGGCTGAAG TGGCCGGGCG GGAACTTCAC GAGCCAGTAC
AACTGGCGCG ACGGCATCGG TCCGCTCGAC GAGCGGCCGA TGCGCTTCAA TCACGCGTGG
GGCGGCGTCA ATCCGAACTA CTTCGGCATC GACGAGTACC TCGAGCTGTG CGAGATCGCC
GACCTCACGC CGCGACTGAC TATCGGCTGG TGGGACAACC CCGGCGAGTG GGCCTCGGAG
CGCCAGATCC TCCCCGAAGA CGCCGCCGAC TGGGTCGAGT ACTGTAACGG GTCGACGGAG
ACGGAGATGG GCGCGCTGCG CGCCGAGAAC GGCCACCCGG AGCCGTACGA CGTCAAACAC
TGGGAAGTCG GCAACGAGGT CTGGGGGCCG TGGCAGCGCG GCCACACCGC CGATCCCTCC
GAGTACGCGA GCGGCTCCGA GGAGCGGATC GGGTTCAACG AGTACTACGA CGCGATGATG
GCAGTCGACG ATTCCATTAC CGTCCTCGCG GACGGGATGG ATCCGGGCTA CGACGAGGCG
GAGACGCCGG ATCCCGACGA GTGGAACGGG ACGCTGTTCG AGGAGTCCGG AGACCGCATC
GACGGGCTGG ACCTCCACCG CTACAACTGG GGTATCGAGG ACCAAGAGGC GCGAGACGCG
TGGTTCGACG AGAACGACGC GGACGCCATC GACTACAACG AGGTGCTGGT CATGTTCCCG
ACGCAGTTCG GGGTGTTGAT GGACGACCTC AGCGCCGAAG CCGCCGACGC GGGGATCGAG
GACTTCCGGA TCAACGTCGG CGAGTACGGG CTCTTTCCGT CGGTCAACGA GGGCGATCCG
TACCCCGGCC CGGAGACGAT GCCGGGCGGC TCCTATATCG CGGGCATGCT CAACTCGTTC
ATCCGGCAGA GCGAGACCGT CGTCGAGGCC TCTCAGACGT GGGTGCCCGT GCGCATGTTC
CCGCCGGAGT TCACCGAGGC GCCGCCGGAT CCGAACCCGC TGGCGCCGGC GGGATCGGTG
TTCGGTCTCT ACTCGGCGGT GTTCGAGACC AACGCCGAGT GGCACGCGGT CGATACCGAG
ATCGACGGCG ACGGCCGTGA CATCCCCGAC ACCGGACCGC GCATCGACCC CATGGAGGAC
GTTCCCTACG TCGACGCCGC CGCCATGCAG AACAAGCGGG GCAAAGAACT GTGCGTCTTC
CTCACGAATC GGAACCTCCG GGAGCACGGC GAGGTCACGA TCGATCTCCC CGAGAAGTAC
GCCGGCAAGT CTGTGGCGAT CACTCGCCAC CGCGCGACCG CGTCCGAGCG ACCGCTTCCG
CACGACTTCC AGGACTCGTG GGAGGAACCG GACGTCTACG CGGTGGACAC CGCCATCGAA
TCAGTCGACC GTGACGGCTC GCTCACGCTC GAGGTCGGCC CTGCCTCGGT CGTCCGGTTA
CTCGTCGATA ACGACCACGG ACGTCCCGCG ACGATCGGCG ACGACGGCGT GTGGTCGGGG
CTCAACGGTA ACGAGTGCGA CCGCCGTCCC CGAAAGTGA
 
Protein sequence
MVVAGAATGT AAADSHTGAE SGDGIRNTVS LDLSERTEDE VSDQLFGRLC EHYESGTIYP 
GVYSEHVKNN SFYPRTWSED DYFGPKTRFD PESIARHENV PFPWEPVDNS GVSFEQREGG
VAAVDTTNYQ RVSLEDARGG ISQKIVLPDF RTLGYDLSFS VRGDGLESVT AAITTLDGET
LATADVDVTD DWTRHELALE LSEASGDQYV AGSVANVDTP YGEYVLEFTA EGSGHVDLDW
ITLGADDAIN GKFNPSTVEL MREQNTTWLK WPGGNFTSQY NWRDGIGPLD ERPMRFNHAW
GGVNPNYFGI DEYLELCEIA DLTPRLTIGW WDNPGEWASE RQILPEDAAD WVEYCNGSTE
TEMGALRAEN GHPEPYDVKH WEVGNEVWGP WQRGHTADPS EYASGSEERI GFNEYYDAMM
AVDDSITVLA DGMDPGYDEA ETPDPDEWNG TLFEESGDRI DGLDLHRYNW GIEDQEARDA
WFDENDADAI DYNEVLVMFP TQFGVLMDDL SAEAADAGIE DFRINVGEYG LFPSVNEGDP
YPGPETMPGG SYIAGMLNSF IRQSETVVEA SQTWVPVRMF PPEFTEAPPD PNPLAPAGSV
FGLYSAVFET NAEWHAVDTE IDGDGRDIPD TGPRIDPMED VPYVDAAAMQ NKRGKELCVF
LTNRNLREHG EVTIDLPEKY AGKSVAITRH RATASERPLP HDFQDSWEEP DVYAVDTAIE
SVDRDGSLTL EVGPASVVRL LVDNDHGRPA TIGDDGVWSG LNGNECDRRP RK