Gene Hore_04770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04770 
Symbol 
ID7314456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp509880 
End bp511154 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content44% 
IMG OID643610900 
Productalpha/beta hydrolase fold protein 
Protein accessionYP_002508230 
Protein GI220931322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00000317728 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTTT TAATTTTTTT ACTCTGCATA TCCCTGGTAT TCACCCACCA GGCTACAGCC 
CAGGAGACCA TCTCAGGAAC CTGGAAGGGA GCTATTAATG TCAAGGGGCA GACACTGGAC
ATTACCATTC ATATTAAACC TGATAATAAT GGTGGTTACC TGGCTACCAT TGATATTCCC
GCCCAGGGGG TTAAAAACTA TGCCCTGAAG AATGTAAAAT ATAATCACCC TGACCTCTAT
ATGGAGCTAC CAGCTAATAT AACCGGTTTC TTCAATGGTA AAGTCAGTGG AGCCAGAATT
AAAGGAAAAT ATACCCAGGG TTCAGCCCGG GGAACCTTTT ATCTAAATAA AAAGACAACC
GATGAATCCG AAACCAGATC AAAAGATAAT GATACCGGGA CTGAACCCAT TAGCCTGAAA
ACAGAGACCG GTACCATATA TGGTACCCTC CAGTTACCCC ATTCTAACAA AAAATCCCCG
GTAATACTTA TAATTGCCGG TTCTGGAATA ACAGACCGGA ATGGTAATTC ACCTGGTGCT
ACCAATAACT GCCTTAAGAT GCTAAGCCAG GACCTGGCCA GGGCCGGTTT TGCTTCTGTC
AGGTATGATA AAAGGGGAAC CGGTCAGAGC AAAGGAGCCA TTAATAGCCC TTCTGACATC
AGGTTTGAAC ACTTTATAAA TGACGCGACT GGCTGGGTTA AAAAATTGAA GAAAGATAAA
AGATTTACCG GAGTAACTGT CTTAGGACTC AGCCAGGGGT CCCTGGTGGG AATGATCGCC
GCCCGCCGCG CCGAGGCCGA TGCCTTTATA TCTCTGGCCG GAGCCGGTCG TTCCATTGAT
AAGGTCTTAA AATATCAACT AATAAGCCTT AATGATGATC TATACCAGGA AGCCCTGGAT
ATTCTGGATA AACTGAAACA GGGGCAGACG GTAAGCCAGG TCAACCAGAA ACTCTATTCT
ATCTTTCATC CCTTAAACCA GCCTTTTCTT ATCTCCTATA TCAAATATGA CCCGGCTGAA
GAGATAGCTA AACTTGATAT CCCGGTCCTT TTGATTCACG GAACAAATGA TATCCAGGTC
AAAAAGGAAG AAGCTAATAT TCTTAAAAAA GCCTATCCAG AAGCAAAATT GGTCCTCATC
GAGGGAATGA ACCATGTCCT GAAAAAAGCA CCGGAAGACC CCAGGCAAAA CTACATGACC
TACAACAACC CTGATCTACC TCTGGCTGAT AACCTGGTCG AGAGTATTGT TAAGTTTCTT
GAAAAGGTAT ATTAA
 
Protein sequence
MSLLIFLLCI SLVFTHQATA QETISGTWKG AINVKGQTLD ITIHIKPDNN GGYLATIDIP 
AQGVKNYALK NVKYNHPDLY MELPANITGF FNGKVSGARI KGKYTQGSAR GTFYLNKKTT
DESETRSKDN DTGTEPISLK TETGTIYGTL QLPHSNKKSP VILIIAGSGI TDRNGNSPGA
TNNCLKMLSQ DLARAGFASV RYDKRGTGQS KGAINSPSDI RFEHFINDAT GWVKKLKKDK
RFTGVTVLGL SQGSLVGMIA ARRAEADAFI SLAGAGRSID KVLKYQLISL NDDLYQEALD
ILDKLKQGQT VSQVNQKLYS IFHPLNQPFL ISYIKYDPAE EIAKLDIPVL LIHGTNDIQV
KKEEANILKK AYPEAKLVLI EGMNHVLKKA PEDPRQNYMT YNNPDLPLAD NLVESIVKFL
EKVY