Gene Hhal_0925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0925 
Symbol 
ID4709879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp999053 
End bp1000315 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID639855394 
Productpeptidase M42 family protein 
Protein accessionYP_001002503 
Protein GI121997716 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAT CCAGCAAGCC GTGGACCCAG TCCATGCCCG AGGAGCAGTT CGAGCGCATG 
CGCGAGGTCC TCGCCGCGCC CAGCCCGGTC GGCCTCGAAG GGGCCATGAC CTACGGGGTG
CTCAAGCCGT ACTTCGAATC CTTCGCGCCC GCCGAGTGGC GCGTCCACCA GTTCCAGGGG
CACGCCGGCA TCGTCCTCGA CACCCATCCG GGCCGGGACG ATCTATTCAA GGTGATGGTG
GTCGGCCACG CCGACAAGAT CCGCATGCAG GTGCGCAGCA TCGGCGACGA CGGCAAGGTC
TGGATCGACA GCGACTCCTT CCTGCCCGGC ACCCTGATCG GCCACGAGGT CACCCTGTTC
AGCGAGGCCC CGGAGAACCC CGGCGCCTAC CGGCGCATTG AGGGCGGCAC CGTCGAGGCT
CTGGGCGCCA TCCACTTCGC CGACGAGGAG ACGCGCACCG GGCGTAAGGG GGTCAGGAAG
GAGCAGCTCT ACCTGGAGCT TCACATCCAC GGCGAGAACA AGAAGAAGCA GGTCGAGGAC
CTCGGCGTCC GCCCCGGCGA CCCGATCCTC CTCAACCGGC CCATCCGCCG GGGCTTCAGC
CCGGACACCT TCTACGGGGC CTATCTGGAC AACGGCCTGG GGTGCTTCAC CACCGCCGAG
GCGGCGCGCC AGATCGCCGA GGCCGGCGGC GCCCGCAACG TGCGCATGCT CTTCGCCATC
GCCAGCTACG AGGAGATCGG CCGCTTCGGC AGTCGCGTGC TGGCCAGTGA GCTGCGCCCC
GATGCGCTGA TTGCCGTGGA CGTGGACCAG GACTACGTCG CCGCCCCGGG GGTCTCGGAC
AAGCGCTTCC AGCCCCTGAC CATGGGTGCC GGCGTCACCT ACACCGTTGG CGCGGTGGCC
AGCGATCAGC TCAACGCGGT GATCCAGCGG GTGGCGACCG AGCAGGACAT CCCGGTGCAG
CGCGACGTCA GCGGCCGCGA CACCGGCACC GACGGCATGG CCGGGGTGCT CGGCAACGTG
GATTGCACCG CCGCCTCGCT GGGGATCCCG GTGCGCAACA TGCACACCAT CTCCGAGAGC
GGCCACACCG GGGACGTCCT GGCGGCCATC CACCTGGTCA CCGGGACCCT GCAGGCCCTC
GATGCCCAGG ACGACGGCAG TGGCCGACTG CGCGAGACCT TCCGCCAGGG GCATCCACGC
CTGGATCAGG CGGCCGGGCT CAGCCACCCG GGCCCGAAGG CCAAGAACGG CGAGGCGAAG
TAA
 
Protein sequence
MTQSSKPWTQ SMPEEQFERM REVLAAPSPV GLEGAMTYGV LKPYFESFAP AEWRVHQFQG 
HAGIVLDTHP GRDDLFKVMV VGHADKIRMQ VRSIGDDGKV WIDSDSFLPG TLIGHEVTLF
SEAPENPGAY RRIEGGTVEA LGAIHFADEE TRTGRKGVRK EQLYLELHIH GENKKKQVED
LGVRPGDPIL LNRPIRRGFS PDTFYGAYLD NGLGCFTTAE AARQIAEAGG ARNVRMLFAI
ASYEEIGRFG SRVLASELRP DALIAVDVDQ DYVAAPGVSD KRFQPLTMGA GVTYTVGAVA
SDQLNAVIQR VATEQDIPVQ RDVSGRDTGT DGMAGVLGNV DCTAASLGIP VRNMHTISES
GHTGDVLAAI HLVTGTLQAL DAQDDGSGRL RETFRQGHPR LDQAAGLSHP GPKAKNGEAK