Gene Hhal_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1107 
Symbol 
ID4710053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1202590 
End bp1204332 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content70% 
IMG OID639855579 
Productglycoside hydrolase family protein 
Protein accessionYP_001002685 
Protein GI121997898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.185203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTATTG ATACCGGCGC CGCGAACCCG ACAGAAAACC CACCGGTGCG CGTGGTGCTC 
TGCTGGCACA TGCACCAGCC GAGTTACCTC GACCCCGGCA CCGGCGACTA CCTGCTGCCG
TGGACCTACC TGCACGGCAT CAAGGACTAC GCCGACATGG CGGCCCACCT CGAGGCCAAC
CCAGCGGCCC GCGCCGTGGT CAATTTCTCG CCGGTACTCC TGGAGCAGCT GGAGGACTAC
AGCCGCCAGA TCGCCGGCTT CCTGGAGAGC GGCGAACGGC TGCGCGACCC CCTGCTGCGC
GCCCTGGCCA TGCCCACCCT GCCGGTCAAC AACCCGGAAC GCCGCCAGCT CGTGGAGCAG
TGTCAGCGCA TCAACCGCCA GCGGCTGGCG GACCCGTTCC CCGCCTTCCG ACACCTGCTC
GAGATCGCCG AGACGGTCAG CGCCCACCCG GCCAGCATGC GCTACCTCAG CGACGCCTTC
CTGGTCGACC TGATCACCTG GTACCACCTG GGCTGGATGG GCGAGACGGT GCGCCGCAGC
GAACCACGGG TCCAGCGGCT GATCGACAAG GGCGACGGCT TCACCCTCCA CGATCGGCGC
GAGCTGCTCG GTGTCATCGG CGAACTGCTC GCGGGACTGC CCCACCGCTA TCGGCACCTG
GCCGAGACCG GACGGGTGGA GCTGTCGATG ACACCCTACG CCCACCCGAT CCTGCCCCTA
CTCCAGGACC TGCAGTCCGC CCGCGAATCC TGGCCCGCGA GCCCGATGCC GGTGGTCGAT
CAATACCCCG GTGGCGAGGA ACGGGCGCGC TGGCACCTGG CGCGCGGCCT GGAGATCTTC
GAACACACCT TCGGCCACCG CCCGCACGGC TGCTGGCCGT CGGAGGGGGC CGTCAGCCAG
GCCACGCTCG CGCTGCTCCA ATCGTACGGT TTCCGCTGGA CGGCCAGTGG CGGTGCAGTG
CTCGACAACA GCCTGGGCGC AGAGGGCGGG GGGAACGGCC AGTGGCACCG CGCCTACACC
CTCGCCCCGC AGCACCCCGA AGCCGAGGGC CACGCGGCCA CACGCTGCTT CTTCCGCGAC
GACGGACTCT CCGATGCCAT CGGTTTCGAG TTCTCCGACT GGCACGGCGA CGACGCCGTG
GCCAACCTGG TCAGCCGCAT GGAGGCCATC GCCGCCGGCT GCGAGGCGCC GGAACAGACC
GTGATCTCGG TCATCATGGA CGGTGAGAAC GCCTGGGAGC ACTACCCGGC CAACGGCTTC
CATTTCCTGA ACGGGCTCTA CCAGCGCCTG GCCGAACACC CGGGGCTGAT CCTGACCACC
TTTGCCGAGG CCGCCGACGA GACCGAACCT CGCGATCTGC AGCGACTGGT GGCCGGCAGC
TGGGTCTACG GCACCCTATC CACCTGGATC GGCGAGGTGG ACAAGGACCG CGCCTGGGAG
CTCCTCGCCG AGGCCAAGCA GGCCTTCGAT CAGCGCGCCG GGGAACTCGA CGAGGCCACC
CGCATCCGGG CCGAGGCGCA GCTGGCCATC TGCGAGAGCT CCGATTGGTT CTGGTGGTTC
GGCGACTACA ACCCCGCCGA GGTGATCCGG GACTTCGACC ACCTCTACCG CGTCCAGCTC
GCCGCCCTCT ACCACACCCT GGGCCAGGAG CCGCCGGAGC ATCTCAGCCA CGCCTTCGCC
CGCCAGGGCC GCGGCCAGCC CGAGCGCGGC GGCGTCATGC GGCACGGACG CGCAGAGCCG
TGA
 
Protein sequence
MSIDTGAANP TENPPVRVVL CWHMHQPSYL DPGTGDYLLP WTYLHGIKDY ADMAAHLEAN 
PAARAVVNFS PVLLEQLEDY SRQIAGFLES GERLRDPLLR ALAMPTLPVN NPERRQLVEQ
CQRINRQRLA DPFPAFRHLL EIAETVSAHP ASMRYLSDAF LVDLITWYHL GWMGETVRRS
EPRVQRLIDK GDGFTLHDRR ELLGVIGELL AGLPHRYRHL AETGRVELSM TPYAHPILPL
LQDLQSARES WPASPMPVVD QYPGGEERAR WHLARGLEIF EHTFGHRPHG CWPSEGAVSQ
ATLALLQSYG FRWTASGGAV LDNSLGAEGG GNGQWHRAYT LAPQHPEAEG HAATRCFFRD
DGLSDAIGFE FSDWHGDDAV ANLVSRMEAI AAGCEAPEQT VISVIMDGEN AWEHYPANGF
HFLNGLYQRL AEHPGLILTT FAEAADETEP RDLQRLVAGS WVYGTLSTWI GEVDKDRAWE
LLAEAKQAFD QRAGELDEAT RIRAEAQLAI CESSDWFWWF GDYNPAEVIR DFDHLYRVQL
AALYHTLGQE PPEHLSHAFA RQGRGQPERG GVMRHGRAEP