Gene Hore_19900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19900 
Symbol 
ID7312805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2146115 
End bp2147362 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content42% 
IMG OID643612436 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509732 
Protein GI220932824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAGGT TCCTTGTGGT AACTGTAGTA GTCCTGTCGG TTATGTTAAT AAGTGGTGCT 
GCCCTGGCGG AAGAGTTAAA TGTCCTTTAT ATGGCCCAGG CAGGATATCA GCCTGAAGAA
GTCAGGCAGA TGGCAGATGT TTTTGAAGAA ATTGTCGGGG TTGAGGTAAA TATTACTTTT
GTAAAGTATG ATGAAATGCA CGATAAAATT GTAACTTCAG CTGCTGTACC TGTCGGGACT
TATGATGTTG TACTGGAAGA TCTGATCTGG ACAGCTGAAT TTGCAGAATA TGGATTTGTA
GAGCCCATTG ATGACCGTGT TAATGATCGA ATTTTAAATG ATATACCTAA AGCTATCCTC
GATGCATTTC GCTATAATGG TAAGCTCTGG GCCATGCCCT ACCTGGCTAA CTTCCAGTTA
TTTTTCTACA ATGAAGACAT GATTAAAAAA GCAGGATTTG ATGGACCTCC CAAAACCCTG
GAAGAAATGG TTGAACAGAT GAGGGTTATG AAGGAAAAGG GTATTGTGGA GTATCCCTTG
GTTGATTCCT GGAACCAGAA AGAAGGTCTG GTCTGTGAGT ATGTCTGGTT AACCGGGGCT
TTTGGTGGAG ACACTTTTGA TGAAAATGGT AACCCCGTTT TTAACCGGGG ACCGGGACTT
GAGGCTCTTA AATTTATGAA GATGCTTCTG GATGAAGGAC TTGCTAATCC CCAGTCTTTA
ACACTTAATG AAAATATGGC TAAAGATGTC TTTATTGCCG GAGATGCTGC TTTTACTACC
AACTGGACCT TCCAGTATGG TGCCATGAAA GATCCTGAAC AGTCACAGGT AGTAGACTCA
GGTAAAATGG GACTGATTCC GGTGGCTGAA GATGTCCTCG GTAAGTATAA GTATAATACA
GCATCAGTAT CCGGATTCCA GGGAGCAGCT ATAATGGCTA ACTCTGAACA TAAGGATCTG
GCCTGGAAAT ATATCCGTTT TATTACCAGT CCTGTTGTTC AGCGTGGTTA CCTGGTAGAA
ATGCCTGTCT GGAAATCTGT CCAAAATAGT GCCTATGCCC AGTCTAACTT CCCGACCATC
AAGATAAAAG CTAAAGAAAT TGCCAGTGTT CATCACAGGC CTCGTGTTCC CAACTATCAG
GAGGTATCTT CCATATTACA GAGATATATT CACCAGTGCC TGGAAGGTAA ATATGAACCT
GAAGAAGCCC TTGATGCTGC TGTAAAGGAA ATTAAAAACC TGAAATAG
 
Protein sequence
MKRFLVVTVV VLSVMLISGA ALAEELNVLY MAQAGYQPEE VRQMADVFEE IVGVEVNITF 
VKYDEMHDKI VTSAAVPVGT YDVVLEDLIW TAEFAEYGFV EPIDDRVNDR ILNDIPKAIL
DAFRYNGKLW AMPYLANFQL FFYNEDMIKK AGFDGPPKTL EEMVEQMRVM KEKGIVEYPL
VDSWNQKEGL VCEYVWLTGA FGGDTFDENG NPVFNRGPGL EALKFMKMLL DEGLANPQSL
TLNENMAKDV FIAGDAAFTT NWTFQYGAMK DPEQSQVVDS GKMGLIPVAE DVLGKYKYNT
ASVSGFQGAA IMANSEHKDL AWKYIRFITS PVVQRGYLVE MPVWKSVQNS AYAQSNFPTI
KIKAKEIASV HHRPRVPNYQ EVSSILQRYI HQCLEGKYEP EEALDAAVKE IKNLK