Gene Hore_18440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18440 
Symbol 
ID7313842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1969187 
End bp1970320 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content41% 
IMG OID643612291 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_002509588 
Protein GI220932680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00065366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGAC TGGAAGGTAT AAGTAAAGTT TATCCCAATA TGGACAGACC TGCTGTTAAG 
GAATTGAATT TACATATTGA GGAAGGGGAA ATATGTATTC TGGTTGGTCC CTCAGGATGT
GGTAAAACTA CTACTCTAAA AATTATTAAT CGTCTTATTG AACCTTCTTC AGGAAAAATA
TATATAAACG GGAAAGACGC CATGAAAGAA GACCCCAATG AATTAAGGCA AAATATTGGT
TATGTCATCC AGCAGATTGG CCTCTTTCCT CACATGACAG TTTATGAAAA TATTGCTACT
GTACCAAGAC TCCGGGACTG GGATGAAGGC CGGATCCGGA AAAGAGTTGA TGAGCTTTTA
GAGATGGTGG AACTCGATCC GGAAGAAAAC CGTTATAAAT ATCCCATGGA GTTATCCGGT
GGACAACGGC AGAGGGTCGG GGTAGCCCGG GCTATGGCGA TAGACCCACC TATTATGTTA
ATGGACGAGC CCTTTGGAGC AGTTGATCCC ATAACCCGGA CCCAGCTTCA GAACGAGTTT
TTAAAACTGC AGCGCAAGAT AAAAAAGACC ATAGTTTTTG TTACTCATGA TATAGATGAG
GCCATTAAAA TGGGAGATAA AATAGCTATT ATGAATCAGG GTGAACTTGT TCAGTTTGAC
ACCCCGGCCA ACATTTTATT CAACCCCGGG AATGAATTTG TCGAAGACTT TGTTGGTTCT
GACAGGGGCC TTAAGGTCCT TAATTTAATA CATGTTGACA AAATAATGAA TACCGGTGTT
CCTACCGTAG AGAGTGTTTC CCGGGCTGAG GATGTTTTAA AGGAGATTAA TAACCTGGAT
CAGGATTATA TCATGGTCAC CGGTGAAGAT GAACACCTGG CCGGTTATAT AAGCAGCAAT
AGATTGAAAA AACATCAGGA TTCCGACTGG TATAAATTTT TGAAACCGAC CCCGGTTGTC
GAAATAGAGG CAACTTTAAA GGATGCCCTG GCTAAAATGA TCGAAAATGA TGTGGCTGTA
GTCCCGGTTG TAAATGATGA GCGGGAACTG GTCGGAACTG TTACTTTAAA AGATATTAGG
TCTTATGTCA GCAATTCCTA TCAGGAAAAT GATTTAGTGT CAGTTAATAT ATAA
 
Protein sequence
MIRLEGISKV YPNMDRPAVK ELNLHIEEGE ICILVGPSGC GKTTTLKIIN RLIEPSSGKI 
YINGKDAMKE DPNELRQNIG YVIQQIGLFP HMTVYENIAT VPRLRDWDEG RIRKRVDELL
EMVELDPEEN RYKYPMELSG GQRQRVGVAR AMAIDPPIML MDEPFGAVDP ITRTQLQNEF
LKLQRKIKKT IVFVTHDIDE AIKMGDKIAI MNQGELVQFD TPANILFNPG NEFVEDFVGS
DRGLKVLNLI HVDKIMNTGV PTVESVSRAE DVLKEINNLD QDYIMVTGED EHLAGYISSN
RLKKHQDSDW YKFLKPTPVV EIEATLKDAL AKMIENDVAV VPVVNDEREL VGTVTLKDIR
SYVSNSYQEN DLVSVNI