Gene Hore_20090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20090 
Symbol 
ID7314333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2165814 
End bp2166995 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content41% 
IMG OID643612455 
Productpermease 
Protein accessionYP_002509751 
Protein GI220932843 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAAATT TGACCAGACT AAAAAAATTT ATATTATATG TCACCATATT TATGATGGCT 
TATTTTGTTC CATTTCATAA TTTAAAAATT CAGGAGGCAA TAGTCGAGTC CTTTTTAATG
CTACAGGAAT ATGCCAGGGA ACATGTTCTT TTTTGTTTGA TACCCGCATT TTTTATTGCC
GGTGCTATTG CCAATTTTAT TTCACAACAG GCCGTAATTA AATACTTCGG AAGCCAGGCT
AAAAAATGGG TTTCATATAC AGTAGCTTCT GTATCAGGTG CCATTCTGGC TGTCTGTTCC
TGTACTGTTT TACCCCTTTT TGCCGGTATT TATAAGAGGG GAGCCGGGAT TGGTCCAGCT
ACGGCCTTCC TGTATTCAGG TCCAGCCATT AATGTTCTGG CTATAATTCT GACAGCCAGG
ATACTGGGGT GGCAGATGGG GCTGGCCAGG GCTATTGGTG CAGTTATCTT TGCTCTGGTG
ATTGGCTTAC TGATGGCAGT AATCTTCCGT AAAGAAGATA AAGAGAGGCT TGAAGGAGTA
ATGGGTAAAA ATACTGAGGG TGTTGCGGGT AGAACTGGTC TTCAGAACTT AATATATTTT
ATGACCCTGG TCTTAATTTT AATTTTTGCC GCCTGGGGTA AACCCCAACA GGCAACAGGT
TTCTGGGTGA AAATATTTAA TATTAAATGG ATAATTACTA TTACTTTACT AATAATAATG
GTAATCATCT TAAAGAGCTG GTTTACTAAG GGAGAACTTA AAGACTGGAT AGATTCTACC
TGGGATTTTG CTGCCCAGAT ATTACCCCTG TTGTTTGCCG GGGTTCTAAT AGCCGGTTTT
TTAATGGGAC GTCCCGGTAC CGATGCTGGT ATTATTCCGC CAGACTGGGT TACCAGGTTT
GTGGGAGGTA ATTCTATTTT AGCAAATTTC ACGGCATCGA TTCTGGGAGC ATTTATGTAT
TTTGCTACCC TGACTGAAGT ACCTATTTTA CAGGGACTTC TTGGTCTGGG AATGGGTAAA
GGACCAGCCC TGGCCCTTTT ACTTGCCGGA CCGGCTTTAA GTCTACCCAA TATGCTTGTT
ATTCGCAGTG TTATGGGAAC TAAAAAGACA CTTGTTTTTA TAGGTTTAGT TGTAGCTATG
GCTACAATCA GTGGACTAAT TTATGGTGCC ATAGTAGTTT AA
 
Protein sequence
MINLTRLKKF ILYVTIFMMA YFVPFHNLKI QEAIVESFLM LQEYAREHVL FCLIPAFFIA 
GAIANFISQQ AVIKYFGSQA KKWVSYTVAS VSGAILAVCS CTVLPLFAGI YKRGAGIGPA
TAFLYSGPAI NVLAIILTAR ILGWQMGLAR AIGAVIFALV IGLLMAVIFR KEDKERLEGV
MGKNTEGVAG RTGLQNLIYF MTLVLILIFA AWGKPQQATG FWVKIFNIKW IITITLLIIM
VIILKSWFTK GELKDWIDST WDFAAQILPL LFAGVLIAGF LMGRPGTDAG IIPPDWVTRF
VGGNSILANF TASILGAFMY FATLTEVPIL QGLLGLGMGK GPALALLLAG PALSLPNMLV
IRSVMGTKKT LVFIGLVVAM ATISGLIYGA IVV