Gene Hore_06650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06650 
Symbol 
ID7314571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp720621 
End bp721853 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content40% 
IMG OID643611096 
Productarginine deiminase 
Protein accessionYP_002508417 
Protein GI220931509 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000122198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAA AAAGTCCTCT TAATGTAACA TCTGAAATAG GCAAACTAAA AAAAGTACTA 
CTGCATCGAC CAGGCCACGA AATTGAAAAT TTAACTCCTG ATTTACTGGA AAGGTTACTA
TTTGATGACA TTCCCTATTT AAAGGTAGCT CAGGAGGAGC ATGATGCCTT TGCTCAGACC
CTGAGGGATA ATGGAGTAGA AGTACTTTAT CTTCATGAAC TGGCTGCAGA AGCCATCCAG
GAAGATGAAA TCAGGAAAAA ATTTATTGAG CAATTTTTGG ATGAAGCTGG TGTAATTGGA
AAAGGAGCCC GTCAGGTCCT GAAAGAGTAT TTTGCCGATA TGGATAATGA AACCTTAATT
AGAAAAATGA TGGCTGGAGT CAGGAAAAAG GAGATACCGG CCATAGAGAA GGTTGCTTCT
TTGAATGATA TGGTAGAAGA AGATTACCCC TTTGTTTTAG ATCCGATGCC CAACCTCTAT
TTTACTAGAG ATCCTTTTGC CACTATTGGT ACAGGTATTA CTTTAAACCA TATGAGGACT
GAAACCCGTA ATCGGGAAGT TATTTTTGCC GAATACATCT TTAGTTATCA CCCTGACTTT
AAAGATACTG AAATCCCCTT CTGGTTTGAC AGGAATGAAA CAACCTCTAT TGAAGGCGGA
GATGAGCTGA TTTTAAGTGA TAAGGTCCTG GCTATGGGTA TTTCTGAGAG AACTGATGCT
GCTTCTATAG AAAAAGTAGC CCGTAATATC TTTACTGATG GTCAGCCTTT TGAGACTATT
CTTGCTTTTA AGATTCCAGA AAAACGCGCC TTCATGCATC TGGATACTGT ATTTACAATG
GTTGATTATG ATAAGTTTAC TATTCATGCT GAAATTGAAG GTCCCCTCAA GGTTTATTCA
ATTACTAAAG GGGATAATGA TGAGCTTAAG ATTGATGAAG AAAAAGCTAC CCTTGAGGAT
ACTTTAAAGA AATACCTTGG GCTCGATGAA GTTACCCTTA TCAGATGTGC CGGTGGCGAT
TATATTGATG CCGGACGTGA GCAGTGGAAT GATGGTTCTA ATACCCTGGC TATTGCTCCT
GGTGAAGTAG TTGTTTATAA CCGTAACCAT ACTACAAACA GGCTCCTGGA AGAGCACGGT
ATTAAACTCC ATGTTATTCC CAGTTCTGAG TTATCCCGTG GCCGTGGTGG TCCAAGATGT
ATGAGTATGC CCCTGGTACG TGAAGATATT TAA
 
Protein sequence
MFKKSPLNVT SEIGKLKKVL LHRPGHEIEN LTPDLLERLL FDDIPYLKVA QEEHDAFAQT 
LRDNGVEVLY LHELAAEAIQ EDEIRKKFIE QFLDEAGVIG KGARQVLKEY FADMDNETLI
RKMMAGVRKK EIPAIEKVAS LNDMVEEDYP FVLDPMPNLY FTRDPFATIG TGITLNHMRT
ETRNREVIFA EYIFSYHPDF KDTEIPFWFD RNETTSIEGG DELILSDKVL AMGISERTDA
ASIEKVARNI FTDGQPFETI LAFKIPEKRA FMHLDTVFTM VDYDKFTIHA EIEGPLKVYS
ITKGDNDELK IDEEKATLED TLKKYLGLDE VTLIRCAGGD YIDAGREQWN DGSNTLAIAP
GEVVVYNRNH TTNRLLEEHG IKLHVIPSSE LSRGRGGPRC MSMPLVREDI