Gene Hore_04230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04230 
Symbol 
ID7314098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp448226 
End bp450055 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content37% 
IMG OID643610846 
ProductCna B domain protein 
Protein accessionYP_002508176 
Protein GI220931268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TGTCAATTCT GGCGATTTTG TTGGTAGGGT TACTGGTCCT GGCTGGTTGT 
AGTAAAGATG CAAATGATGT CAATAGTGCT GTTAAACATA CTTTAACAGT AACTGTTACA
GATGAAGCCA CTCAAGCTGC AATTGAAGGA GCAACAGTAA CAGTTGATGA GGCAAGTAAA
ACTACAGATG CAAATGGTGT AGCTCAGTTT GAATTAACAG ATGGAACCTA TGATATAACT
GTAACAGCTG AGGGATATGA AAGTAAATCA GGTTCAGTTA CAATTGATGG GAAAGACTCC
ACTCTTGATG TAACATTAAC TGCTGGTACT GGTGGATCAA CAGATTTTGT CCTGATTACT
TCTAATAGTG GTGAGGAAAC TGATATATAT GTTGACTGGG ATTATGATAA TAATATAGGG
AATATTGCTG CAGATGCCTG GGGGTCAGGA ACAACTATAA CTCAAGATTC TAGTTATAAT
AATACACCTT GCTGGGAATT AACTACAGGA GATGGCTGGG GTACTGTATT AGCTTTTATG
GGAGATATCT ATAATGTAGA TCAAATTGCA GAGTTCCCGG TAGATTTAAC AGCTGACAGT
GTTATCAGCT TTTCTGTAGC GACTACAGGC GATTATGATG AATTGAGAGT AAAAGTAGTG
GGTGAAGAAA ACGAAAAAGA AATAGCCATA GATAGTTTTG ACAATACAAG TACTGACTGG
CAGACTGTAC AGGTTACTAC TGATCAGTTT TCAGAAGTAG TCCCGGCAAA TGTGACTCAA
ATAGCAATAA TTGCTTTTGG TGGAACAGCT GGAACTTCCA AAGTTTATGT AACAGATTAT
ACAATCAGTA ATGCAGAGGT TGTAATTCCA AAGCCACAAC CTGAAACTTA TACATTAACA
GTAACTGTAA CAGATGATAG TCAAGTTGCA ATTGAAGGTG CCACAGTAAC AGTTAACGGG
ACAAGTAAAA CAACAGATGC AAGTGGTGTG GCTACATTTG ATTTGCCAGA TGGAACTTAT
ACAGTAAATG TAAGTGCTGA TGGCTATGCT AATGGTTCAG GTTCAGTAAC TATTGATGGA
GCAGATAAGT CTGTAGATGT ATCATTAACA AGTCTTGAAG TCACTTCTAC TTTGACAGTA
AATGTTAAAA ATGCTGTCAG TGGAGCAGTA ATTGAAGGAG CAGCAGTAAC AGTTGATGGA
ACTGAAATTG TAACAGATGC AAATGGAGTA GCTCAGTTTG ATTTAGTAGA TGGGGATTAC
ACAATTAATG TAAGTGCGAA TGGTTATAAT ACTGTTTCTC AAGATGTAAC AATAGCTGGA
GCTGATAAAA CTGTAGATAT ATCTTTAACT ACAGATGCTC CGTCTTTATT TAATAGCAGC
TTTACAGTAT TTAATGAAGC TCCAATAACA GCTCTGCAAG CTAGTTATGA AGTATCAACT
ACAGAAATAA CAGAAGGTGA TAGTTCAATC AAGGTAACAT ATCCAGGAGA TAACTGGGGA
GGAATTTATG TAGAAATTTC TACTCCAGTA GATTTAAGTT CATATGATGG AGGTAATTTA
GTATTTGATA TTAAATTGCC CACTACTATA AAAGATATTG GTGTAAAGTT AGAAGGGCCA
AAAGGAACAG GAGTGCAGGT GCAACTCGCT AACTATACCG GAACTGATGC TGGAAATGGA
TGGATGACCT ATACAATTCC TTTATCAGAT TTAGGTATAG ACTTAACTCA AGTAGCAGTA
GGCTTTGGGT TATGGAATCC AACAGATGGA ACAGCACTTG ATGCCTGGGC AGGTGGAGAT
GTTTATATTG ATAATGTAAG GTTTGAATAA
 
Protein sequence
MKKLSILAIL LVGLLVLAGC SKDANDVNSA VKHTLTVTVT DEATQAAIEG ATVTVDEASK 
TTDANGVAQF ELTDGTYDIT VTAEGYESKS GSVTIDGKDS TLDVTLTAGT GGSTDFVLIT
SNSGEETDIY VDWDYDNNIG NIAADAWGSG TTITQDSSYN NTPCWELTTG DGWGTVLAFM
GDIYNVDQIA EFPVDLTADS VISFSVATTG DYDELRVKVV GEENEKEIAI DSFDNTSTDW
QTVQVTTDQF SEVVPANVTQ IAIIAFGGTA GTSKVYVTDY TISNAEVVIP KPQPETYTLT
VTVTDDSQVA IEGATVTVNG TSKTTDASGV ATFDLPDGTY TVNVSADGYA NGSGSVTIDG
ADKSVDVSLT SLEVTSTLTV NVKNAVSGAV IEGAAVTVDG TEIVTDANGV AQFDLVDGDY
TINVSANGYN TVSQDVTIAG ADKTVDISLT TDAPSLFNSS FTVFNEAPIT ALQASYEVST
TEITEGDSSI KVTYPGDNWG GIYVEISTPV DLSSYDGGNL VFDIKLPTTI KDIGVKLEGP
KGTGVQVQLA NYTGTDAGNG WMTYTIPLSD LGIDLTQVAV GFGLWNPTDG TALDAWAGGD
VYIDNVRFE