Gene Hore_15420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15420 
Symbol 
ID7313139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1649744 
End bp1651129 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content47% 
IMG OID643611988 
Producthypothetical protein 
Protein accessionYP_002509286 
Protein GI220932378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000364962 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA TCAAGGTGCT TTCTCCAACT GCTATACTGG GGTATGGTTT TCCAGTAGAG 
TCTTTTGAAA GGGGGCTGGA CCGGAAGCCA GATGTCATTG CGGTCGATGG TGGCTCCACC
GACCCCGGAC CTTATTATCT GGGGTCAGGC CTTTCCTTTA CAGACCGCAA CGCCGTAAAG
AGGGATTTGC ACTTAATGAT AGAGGCCGGT CAAAAGCTAA ATATACCCGT ACTGGTTGGT
ACGGCCGGGG GTTCCGGAGC CAGTGCCCAT CTGAACTGGT GTCTCGATAT TGTAAAGGAG
ATAATAAATG AAGAGGGCTT TAAGCTTAAA ATAGCCACCA TTGGGGCTGA GATTGACCGG
GAAGAGGTTA AAAACAGGTT AAGGGAAGGG AAACTATCTC CCCTCTATCC CGCTGAAGAG
GTTAATGAAG AAGAAATTGA CAGGGCCACG AGAATTGTGG GGCAGATGGG TATTGAACCT
ATTATTGAGG CTCTCAAGGG TGGAGCTGAC CTGATATTGG CCGGGCGGGC CTATGACCCC
ACTGTCTTTG CCGCCTACCC TATCCTGAAG GGTTTTGAAC GAGGTCTGGC TCTACATATG
GGTAAAATCC TGGAATGTGC CAGTATTGCT GCTGACCCGG GAAGTGGGAG TGATTGTATG
CTGGGGATAC TGGGGCAGGA TCACTTTATA CTTGAGCCCC TGAACCCGGA GAGAAGGTGT
ACGGTGACTT CGGTTTCAGC TCATACCCTG TATGAAAAGA GCAACCCCTA TAAACTTTAC
GGTCCCGGTG GGGTTATCGA TTTGACAGAG ACTGAATTTG AACAGATAGA TGAGAGGCGG
GTTAAGGTTA CCGGTAGTAA ATTTATCCCT GATGAAGATT ATACTATAAA GCTTGAAGGG
GCAAAGCTTG TTGGATACCG GACTATATCT ATTGCTGGAA CCAGGGATCC CATTATGATA
CGCCAGATAG ATGATATCTT AAAAGAAGTA AAAAGGATAG TTAACGAAAG TTTCAGTGAG
GACCGGGAAA AATATAATAT TTATTTCAGG GTATATGGTA AAAACGGGGT TATGGGGAAA
CTGGAACCGG TCCAGGAGAT AACCGCCCAT GAACTCGGTA TTGTTATTGA AGTTATTGCC
GATACCCAGA AACGGGCCAA CAGTATCTGT AGTTTTACCA GATCGACCTT GCTCCATTAT
GGTTATCCAG GACGGGTGGC TACAGCCGGT AACCTGGCTT TCCCTTATTC ACCTTCAGAT
ATTAAGGCTG GTGAAGTCTA TGAATTTAAC CTTCACCACC TGGTGCAGGT CGATGATCCC
CTTGAGTATT TCCCTGTCAG GTTTATGACA GAGGATACTA TTCCAGAAGA CGGGAGGTTA
ACATAA
 
Protein sequence
MDKIKVLSPT AILGYGFPVE SFERGLDRKP DVIAVDGGST DPGPYYLGSG LSFTDRNAVK 
RDLHLMIEAG QKLNIPVLVG TAGGSGASAH LNWCLDIVKE IINEEGFKLK IATIGAEIDR
EEVKNRLREG KLSPLYPAEE VNEEEIDRAT RIVGQMGIEP IIEALKGGAD LILAGRAYDP
TVFAAYPILK GFERGLALHM GKILECASIA ADPGSGSDCM LGILGQDHFI LEPLNPERRC
TVTSVSAHTL YEKSNPYKLY GPGGVIDLTE TEFEQIDERR VKVTGSKFIP DEDYTIKLEG
AKLVGYRTIS IAGTRDPIMI RQIDDILKEV KRIVNESFSE DREKYNIYFR VYGKNGVMGK
LEPVQEITAH ELGIVIEVIA DTQKRANSIC SFTRSTLLHY GYPGRVATAG NLAFPYSPSD
IKAGEVYEFN LHHLVQVDDP LEYFPVRFMT EDTIPEDGRL T