Gene Hore_20010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20010 
Symbol 
ID7312816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2157082 
End bp2158410 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content40% 
IMG OID643612447 
ProductCysteine desulfurase 
Protein accessionYP_002509743 
Protein GI220932835 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000584034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACTT ATAAAAAATG GAGAGGGCAG GTTGTTGGTG TTAACAAAAA GGTCCCCCTC 
AGTAATAACA GGCTTTCCAC CTATGTTAAT TTTGATAACG CGGCCACAAC CCCACCATTT
AAATCGGTAA TAGAGCGGAT TACTGAATTT GCAGGCTGGT ATTCCTCCAT TCACCGGGGA
AAAGGCTATA AATCAAGATT ATGTTCTACA ATTTATGAAG AGGCCCGCGG GGATATTCTT
AATTTCGTTA AGGCTGATCC CGATTATTAT ACCCCTATCT ATGTTAAAAA CACCACCGAA
GCAATTAATA AACTGGCCTA TACCCTGGGT CAGGATACTG ATAATAGAAA TATTATAATT
ACAACCTCAA TGGAACACCA TTCCAACGAT TTACCCTGGA GAAAATACTT TAAGGTAGAA
TACATAAAGC TGAACGAAAA GGGGCAACTA TCTCTTGATG ACCTTGAATC AAAACTAATA
AAACACCGGG GCAAGGTAAG ACTGGTTACC GTTGGTGGGG CCTCTAATGT AACCGGTTAT
CTCAACCCGA TTTATCAAAT TGCCGGCCTT GCCCATAAGT ACGGGACTGA AGTAATGATC
GATGGAGCCC AGTTGATTCC CCACCACCCT GTCGAAATGA GTCCAAAAAA AGCGGGAGAA
AGACTAGATT ACCTTGCCTT TTCAGGACAT AAAATGTATG CCCCTTTTGG AACGGGAGTC
CTTATAGCCC CTCAAAAAAC CTTTGCATCA AATACTCCAG ACCAGGTCGG TGGGGGAACA
GTTGATATAG TAACCCCTGA TTTTGTGAGG TGGCATACCC CCCCACATAA AGAAGAAGCT
GGCTCTCCCA ATTTAATGGG GATAGTAGCC CTGACTGAAG CCATTAAAAT TTTAAATGAA
TTCGGAATGG AGTCGATTTT GAATCACGAA AAACGGCTGA CTGATTATAC CCTGAAAAGA
TTAAATAAAA TACCTGATGT CATCCTTTAT GGAAATAAAT TTAATAGTAA GGATAGATTA
GGAATTATCC CTTTTAATAT TGACGGGTTA TCCCATGAAT CAATAGCCAC TATACTGGCC
GGTGAAGGTG GTATCGCCGT AAGAAATGGC TGCTTTTGTG CCCAGCCCTA TGTCCAGCAG
CTCCTCAATA TATCTGAACA GGAAATACGG GCCAGAATAA ACAATCCTGA CCTACCCCAT
CCCGGTCTGA TAAGAATTAG TTTTGGACTA TACAATACAT TTCAGGAGAT TGATAGGTTA
ATAGATATGG TTAAAGTAAT AGTCTCAAAT AAAGAGTATT ATTACAGAAA AACAAAAATT
AATTTTTAA
 
Protein sequence
MNTYKKWRGQ VVGVNKKVPL SNNRLSTYVN FDNAATTPPF KSVIERITEF AGWYSSIHRG 
KGYKSRLCST IYEEARGDIL NFVKADPDYY TPIYVKNTTE AINKLAYTLG QDTDNRNIII
TTSMEHHSND LPWRKYFKVE YIKLNEKGQL SLDDLESKLI KHRGKVRLVT VGGASNVTGY
LNPIYQIAGL AHKYGTEVMI DGAQLIPHHP VEMSPKKAGE RLDYLAFSGH KMYAPFGTGV
LIAPQKTFAS NTPDQVGGGT VDIVTPDFVR WHTPPHKEEA GSPNLMGIVA LTEAIKILNE
FGMESILNHE KRLTDYTLKR LNKIPDVILY GNKFNSKDRL GIIPFNIDGL SHESIATILA
GEGGIAVRNG CFCAQPYVQQ LLNISEQEIR ARINNPDLPH PGLIRISFGL YNTFQEIDRL
IDMVKVIVSN KEYYYRKTKI NF