Gene Hhal_2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2327 
Symbol 
ID4709284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2553537 
End bp2554544 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content73% 
IMG OID639856802 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001003892 
Protein GI121999105 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTGTAC TAGGCATCGA GAGTTCCTGT GACGAGACCG CGGCCGCCGT CTACTGTGGT 
CGTGATGGCC TGTTGGCCCA CGCGGTGCAC AGTCAGGTGG CCGATCACGC CGCTTACGGC
GGCGTGGTGC CGGAGCTGGC CTCACGGGAT CACGTGCGCA AGCTGCCCGG TCTGGTCGGT
GGGGTGTTGC GGGATGCCGG CCTGACGCCG GCAGATCTCG ACGGCGTCGC CTGGACCCGC
GGCCCAGGGC TCCCCGGGGC CTTGATGGTC GGGGCCGGGT TCGCGCGCAC CTTCGCCTGG
GCGCGGGGGC TGCCGGCGGT CGGGGTGCAC CACATGGAGG GGCACCTGCT GGCGCCCCTG
CTCGAGCCCG ACCCGCCGGC CATGCCCCTG GTGGCCCTGC TGGTCTCCGG CGGGCATACG
ATGCTGGTGC AGGTGGCCGA CTTCGGCCGC TACCGGGTCC TGGGGGAGTC GGTGGATGAC
GCCGCCGGCG AGGCCTTCGA CAAGACGGCT CGGCTGCTGG GGCTGCCGTA CCCGGGGGGG
CCGGCCATCG CCCGCCTCGC CGTCGAGGGA ACACCGGGGG CGGTGCGCCT GCCGCGGCCG
ATGACCGACC GGCCGGGGCT GGACTTCAGC TTCAGCGGCC TGAAGACGGC GGTCCTCCAC
GCCGTCGAGG CGGCGGGGAA CGATCAGCAG GCCCGGGCGG ACATCGCCCA CGGATTCCAG
GAGGCGGTGG TGGATACCCT GGTGATCAAG TGCCGGCGTG CCATCGAGCA GACGGGCGCG
GGCCGGCTCG TGGTCTCCGG CGGGGTGGGC GCCAACGCCC GGTTGCGCGA ACGCCTGGAT
GAGGTGGGGC GGGCGAGCGG CTTCACCGCC CACTACCCGC GCCTGGAGCT GTGTACCGAC
AACGCTGCGA TGATCGCCTA CGCCGGCCTG CGTCGGCTGG AGGCGGGCTA CCGCGACGAT
CTCGACTTCA GCGTACGCCC CCGCTGGCCG TTGGCCGAGC TCAGCTAG
 
Protein sequence
MRVLGIESSC DETAAAVYCG RDGLLAHAVH SQVADHAAYG GVVPELASRD HVRKLPGLVG 
GVLRDAGLTP ADLDGVAWTR GPGLPGALMV GAGFARTFAW ARGLPAVGVH HMEGHLLAPL
LEPDPPAMPL VALLVSGGHT MLVQVADFGR YRVLGESVDD AAGEAFDKTA RLLGLPYPGG
PAIARLAVEG TPGAVRLPRP MTDRPGLDFS FSGLKTAVLH AVEAAGNDQQ ARADIAHGFQ
EAVVDTLVIK CRRAIEQTGA GRLVVSGGVG ANARLRERLD EVGRASGFTA HYPRLELCTD
NAAMIAYAGL RRLEAGYRDD LDFSVRPRWP LAELS