Gene Hore_18140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18140 
Symbol 
ID7313812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1935537 
End bp1936931 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content45% 
IMG OID643612261 
Productputative aminopeptidase 1 
Protein accessionYP_002509558 
Protein GI220932650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000477052 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGA AATTTGAAAT TAAGAATGCC TGGCTGGAAA TGGCCGGGGC CGAGAAAGAA 
GAAATAAATC AATTTAATAA AGATTATGCA GGATTCATTA CAGTTAATAA GACAGAGCGG
GAGTTTGTTA AGGCTGCTGT AGAACTACTG GAAAAGGAAG GATTTAAAAA TTTAGAAGAG
GTTCAAAAAC TAAGAGAAGG TGACCGGGTA TATAAGGTTA ACCGGGACCG GGCCATTGCT
GCTGCTGTAA TCGGTCAGAG ACCCTTGACC GATGGGTTGC GGATTGTTGG GGCCCACCTC
GATTCACCCC GGATTGATTT AAAGCCCAAT CCAGTGTATG AGGATAGTGA GCTAGTTTTC
TTCAACACTC ATTATTACGG GGGGATAAAA AAATATCAGT GGGTTTCAAT TCCCCTGGCC
CTTCATGGGG TTGTAGTCAA AGAGGATGGT ACCAGAGTTG ATGTTGTTAT CGGTGAAAAT
GAATCTGACC CAATTTTTTA CATAAGTGAT TTATTACCTC ATTTAAGTAA AGACCAGATG
AAGAAGAAGA TGACAGAAGG TATAACCGGG GAACAGCTGG ATGTCCTCAT CGGGAGTGTA
CCTGCCGGGG AAGAAGAAGA TAGTGACAAG GTCAAGGTAA AATCAGCTGT GCTTGATTTA
TTAAATGAGA AATATGGAAT TACCGAGGAG GATCTGGTTA GTGCTGACCT GAAGTTGGTA
CCCGCTTATA GAGCGAGAGA CCTCGGCTTT GACCGGGGGC TTCTGGCCGG ATACGGACAT
GATGATCGGG TCTGTAGCTA TACAGCTTTG CGCGGATTAA TAGATCTGGA AACACCAGAA
TATACCGGGG TTGCTCTTCT GGTTGATAAA GAAGAAATTG GAAGTATGGG GGCTACCGGA
ATGCAGTCCA GGTTTTTCGA GAACTGCCTG GCAGAGATGA TTGATCTAAC CGGTGAAGAT
TATAGTGATC TGGCTTTGAG AAAGGCCCTG GAAAATTCAT GGGCCCTGTC GGCTGATGTC
AATGCGGCCT TTGACCCCAA CTTCTCTGAT GTTTTTGATA AGGACAACTC TTCCTATCTG
GGTAAGGGTG TCGTCTTAAC CAAATATACC GGGGCCCGGG GTAAGTATAG TTCTTCTGAA
GCCACTGCTG AATTTGTCGG CAGGGTTAGA GCTCTATTCA ATAATGGTGG TGTACCCTGG
CAGATTGGTG AACTCGGCAA GGTTGACCAG GGGGGCGGGG GTACTATCGC CCAGTTCCTG
GCCAATTATA ATATGGATGT TGTTGACTGT GGCCCTGCGG TTTTATCTAT GCACTCTCCT
TTTGAAGTTG TAAGCAAGGT TGATGTTTAC AGTTCTTATT TAGCATACAG TGTTTTCTTT
GGTAGTCAGG GTTAA
 
Protein sequence
MGKKFEIKNA WLEMAGAEKE EINQFNKDYA GFITVNKTER EFVKAAVELL EKEGFKNLEE 
VQKLREGDRV YKVNRDRAIA AAVIGQRPLT DGLRIVGAHL DSPRIDLKPN PVYEDSELVF
FNTHYYGGIK KYQWVSIPLA LHGVVVKEDG TRVDVVIGEN ESDPIFYISD LLPHLSKDQM
KKKMTEGITG EQLDVLIGSV PAGEEEDSDK VKVKSAVLDL LNEKYGITEE DLVSADLKLV
PAYRARDLGF DRGLLAGYGH DDRVCSYTAL RGLIDLETPE YTGVALLVDK EEIGSMGATG
MQSRFFENCL AEMIDLTGED YSDLALRKAL ENSWALSADV NAAFDPNFSD VFDKDNSSYL
GKGVVLTKYT GARGKYSSSE ATAEFVGRVR ALFNNGGVPW QIGELGKVDQ GGGGTIAQFL
ANYNMDVVDC GPAVLSMHSP FEVVSKVDVY SSYLAYSVFF GSQG