Gene Hore_04940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04940 
Symbol 
ID7314473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp536516 
End bp537496 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content41% 
IMG OID643610917 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002508247 
Protein GI220931339 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.00804417 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCAGAAG CGGATAATGT ATTACTGAAA GTTAGAAACC TTAAAAAATA CTTCCCTGTA 
AAGGCCGGTG TTTTCAAAAA AACAGTAGGC TATGTTAAAG CAGTTGATGA TATCAGCTTT
GATATAAAAG AGGGCGAAAC ATTAGGTCTG GTTGGTGAAT CCGGCTGTGG TAAATCCACA
ACCGGACGGA CCATTTTACG CTTACTGGAG GCTACATCAG GGAGAGTAGA GTTTGAAGGT
AAGGATGTTC TTTCCCTGGA CAGGAAAGAA TTAAGGGAAA TGAGAAAAGA AATGCAAATT
ATTTTCCAGG ACCCTTATGC CTCTCTTAAC CCGAGGATGA AGGTTGCTGA CATTGTCGGT
GAGCCTTTAG ATATTCATAA TCTTGCCACA GATAAAAACG AGAAGTATGA AAAGGTTAAA
GAACTTTTAT CCAATGTAGG ATTAAATGAA GAACAGATGA ATCGTTACCC TCACGAATTC
AGTGGAGGAC AGAGACAGCG TATCGGGGTG GCCCGGGCAC TGGCTGTTAA CCCCAGGTTA
ATAATCGCTG ACGAGCCTGT ATCTGCCCTT GATGTTTCTA TTCAGGCCCA GGTTATTAAC
CTCCTGCAGG ATTTACAGGA ACAGTATGGG TTAACATACC TTTTTATAGC TCATGATTTA
AGTGTTGTTA AACACATAAG TGACAGGGTA GCTGTTATGT ATTTAGGTAA AATAGTGGAG
CTAGCCAATA AAAAAGACAT TTATGATAAT CCGTTACACC CCTATACTCA GTCATTACTC
TCGGCAATAC CAATTCCTGA TCCAAATTAT GATAAGAAAA GGATAGTTCT AAAAGGTGAT
GTGCCCAGTC CGGTAGATCC ACCTTCCGGT TGCCGCTTCC ATCCACGTTG TCCAAAAGCC
ATGGACATCT GTAGTCAGGT TGAACCTGAA TTTAAAGATT ACGGTAACGG GCATTTTGCT
GCCTGTCACC TTTTAGATTA A
 
Protein sequence
MAEADNVLLK VRNLKKYFPV KAGVFKKTVG YVKAVDDISF DIKEGETLGL VGESGCGKST 
TGRTILRLLE ATSGRVEFEG KDVLSLDRKE LREMRKEMQI IFQDPYASLN PRMKVADIVG
EPLDIHNLAT DKNEKYEKVK ELLSNVGLNE EQMNRYPHEF SGGQRQRIGV ARALAVNPRL
IIADEPVSAL DVSIQAQVIN LLQDLQEQYG LTYLFIAHDL SVVKHISDRV AVMYLGKIVE
LANKKDIYDN PLHPYTQSLL SAIPIPDPNY DKKRIVLKGD VPSPVDPPSG CRFHPRCPKA
MDICSQVEPE FKDYGNGHFA ACHLLD