Gene Hore_08880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_08880 
Symbol 
ID7314878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp956038 
End bp957216 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content41% 
IMG OID643611321 
Productaminotransferase class V 
Protein accessionYP_002508639 
Protein GI220931731 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.80371e-16 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA TCTATTTAGA TAATGCTGCA ACCACACCGG TTGCACCTGA AGTTATTAAA 
GTTATGGAAC CCTATTTTAA TATATACTAC GGAAATCCCT CAAGTGTCCA CACCCCTGGG
CAGGATGCTG CCAGAGCTGT AAGTGAAGCC CGGGAGAAAG TTGCAGAATT AATTGGAGCA
AGAGATGAAA GGGAGATAAT TTTTACCAGT GGTGGTACTG AAGCCGATAA CCTGGCCATA
AAAGGAGTGG CTATGTCTTT ACAGGACAGG GGTAAACACA TTATAACCTC AAGTGTCGAA
CACCATGCCG TTCTTCATAC CTGTGAATAT CTGGAGAAAT ACCTTGGTTT TGATGTAACC
TATTTACCTG TTGATGAAAA AGGGTTTGTA GACCCTTCTA AGGTGGAAGA GGCTATAAGA
GAAGATACGA TTTTGATATC AATTATGTTG GCGAACAATG AGATTGGGAC CATTCAGCCA
GTTAAAGAGA TCGCTAAAAT AGCCAATGAA CATGATATTT ATTTCCATAC TGATGCTGTT
CAGGCTATCG GTCAGATACC GGTAGATGTA GAAAAACTGG GAGTTGATTT ATTATCTCTA
TCCGGTCATA AGTTTAATGG TCCTAAAGGG GTAGGAGCCC TGTATATAAG AAAGGGTGTT
AAATTAGCAC CCCAGATGTC CGGGGGTGCT CAGGAAAGGA GAAGGAGAGC TGGAACGGAG
AATGTTCCCG GCATTGTTGG TCTGGGTAAA GCTGCTGAAA TGGCTGCACA TAACCTGGAA
GAAAAACGTC TTAAGCTGAA AAAACTTCGT GATAAATTAA TAAACGGCAT TGAAAATGAA
ATTGATGAAG TATATTTAAA TGGTCCCCGG GGGGAAGATA GGCTTCCCAA TAATGTTAAT
TTCTGTTTTA GGTATATTGA AGGGGAATCG ATTCTATTAA ATCTCGATAT GATGGGGATT
GCGGGATCAA GTGGTTCTGC CTGTACTTCA GGTTCTCTGG ATCCTTCCCA TGTTTTACTG
GCTATAGGTA GGCCTCATGA GATTGCCCAT GGCTCTTTAA GATTAACCCT GGGATATAAC
AATACCGAAG AAGAAGTTGA TTATGTTCTT GAAGTATTAC CGGGGATTAT AAAAAAATTA
AGGGCTATGT CTCCATTGTT TGATTCAGCT TCTGAGTAA
 
Protein sequence
MKNIYLDNAA TTPVAPEVIK VMEPYFNIYY GNPSSVHTPG QDAARAVSEA REKVAELIGA 
RDEREIIFTS GGTEADNLAI KGVAMSLQDR GKHIITSSVE HHAVLHTCEY LEKYLGFDVT
YLPVDEKGFV DPSKVEEAIR EDTILISIML ANNEIGTIQP VKEIAKIANE HDIYFHTDAV
QAIGQIPVDV EKLGVDLLSL SGHKFNGPKG VGALYIRKGV KLAPQMSGGA QERRRRAGTE
NVPGIVGLGK AAEMAAHNLE EKRLKLKKLR DKLINGIENE IDEVYLNGPR GEDRLPNNVN
FCFRYIEGES ILLNLDMMGI AGSSGSACTS GSLDPSHVLL AIGRPHEIAH GSLRLTLGYN
NTEEEVDYVL EVLPGIIKKL RAMSPLFDSA SE