Gene Hore_04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04020 
Symbol 
ID7314077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp412184 
End bp413491 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content43% 
IMG OID643610826 
ProductExodeoxyribonuclease I subunit D 
Protein accessionYP_002508156 
Protein GI220931248 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0356343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTGTA TACAAAATCA ATATAAGTTT ATTGTACATG AACTTATAAC TGTTGTTATA 
AAAAGGGGGA GAGGTCCTTT GAGAATTTTA CATACTGCTG ACTGGCATCT GGGGAAACAC
CTGGAGGGAT GGAGTAGATA TGAAGAACAA AAAGAATTTG TTGAAGAAAT AATTGAAATA
GCTGATGATA ATAAGGTAGA CATGGTTTTA ATATGTGGGG ATATATTTGA CACTACTAAC
CCTCCAGCGG AGGCGGAACA GCTTTTTTTT CAGGCGATGG ATTACCTGTC AAAAGGTGGG
GAGAGGGTAA TCTGTTTGAT CTCTGGTAAC CATGATAGTC CCAACCGTCT TATGGCCCCG
GGGCCTCTGG CTTCCAGACA GGGAATTTTT ATTATGGATG AGCCCCGGGG AGACAGGTAT
AAGCTGGATG ATGACCGGGT GTTAAACCGT GGTCAGGGGT ACATAGAACT TGAAATTAAC
GGAGAAGGTG TTGTCCTTAC GGCTCTGCCC TATCCTTCTG AGAGCCGGTT AAACCAGGTC
TTTTCATGGA CCGGTGATGA CCGGGCAGTG CAGGAAAGTT ATTCCCGTCG AGTGGGTCAG
ATTTTTTCCC ATTTAGAGCA ATATTATCGT GAAAATACAA TCAACATTGC CATGAGTCAC
CTTTTCGTTG CCGGGGGTCA GACTACCCGG TCTGAAAGAC CCATCCAGGT TGGTGGCAGC
CTGACAGTAT TACCGGAACA CCTTCCAGAA AAATCCCAGT ACACAGCCCT GGGCCACCTC
CATCGTTATC AAATTGCTTC TTCAGCCCGG AGGGCTTACT ATTCAGGTTC TCCGTTGCAG
TATAGCCTCA GTGAAAAAGA TCATAAAAAG TGTGTTAACC TTGTAGAGCT TCATCCGGGA
GAGGAGGCCC GGATTGAACA GGTTGAATTG ACAACAAAAA AACCAATCGA GGTCTGGGAA
GCAGAAGGGG TTGAAGAGGC TATAAAAATG GTTGAGGCCA ATAAGGACCG TTCTGTCTGG
GCATACCTTT ATATAAATGT GGATAAAACT CTACTTCAGT CTGAAATTAA AAAAATAAAG
GAAATTAAAA AGGATATTCT ATGTATTAAC CCCATAACCC CTGAAGAGAA ATATGAATGG
GAAACTATAA AGATGCCTGA TGAGGACCTG GATATAATGG AATTGTTTAA AGAATATTAT
CGTAAGACAA AAAAAGTTGA GCCTGATGAC GATATTATCA GGATGTTTAG CAGTATTGTT
AATGATACCA GGGAAAAGGG GGAGCATGAT GAGACCGCTG CTTCTTAA
 
Protein sequence
MDCIQNQYKF IVHELITVVI KRGRGPLRIL HTADWHLGKH LEGWSRYEEQ KEFVEEIIEI 
ADDNKVDMVL ICGDIFDTTN PPAEAEQLFF QAMDYLSKGG ERVICLISGN HDSPNRLMAP
GPLASRQGIF IMDEPRGDRY KLDDDRVLNR GQGYIELEIN GEGVVLTALP YPSESRLNQV
FSWTGDDRAV QESYSRRVGQ IFSHLEQYYR ENTINIAMSH LFVAGGQTTR SERPIQVGGS
LTVLPEHLPE KSQYTALGHL HRYQIASSAR RAYYSGSPLQ YSLSEKDHKK CVNLVELHPG
EEARIEQVEL TTKKPIEVWE AEGVEEAIKM VEANKDRSVW AYLYINVDKT LLQSEIKKIK
EIKKDILCIN PITPEEKYEW ETIKMPDEDL DIMELFKEYY RKTKKVEPDD DIIRMFSSIV
NDTREKGEHD ETAAS