Gene Hore_20650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20650 
Symbol 
ID7314389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2233951 
End bp2235036 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content40% 
IMG OID643612509 
Productpeptidase M24 
Protein accessionYP_002509805 
Protein GI220932897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones72 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGAAA CCCTGGTAAA GAAGGAAAAA ATAGATAAGT TGATGAAGGA GAAAGGGCTT 
GATGGGGTTG TTTTGACCAG TCATAGTAAT ATTACCTGGT TAACAGGTAT CGATAATAGA
ATTGTTTTTG CCAGTGATGA AGGTGCTGTG AAGCTTATTA TTTTTAAAGA CAGAATTGAA
GTTGTTACCA ATAATATTGA GGCCGGGAGA ATCCGGGAAG AAGAAGGCCT GGACCAGGAT
TATTATAAAT ATATTGTAGA TGACTGGTAT AGGGCTGACA ATTACCTTAA AGTCCTTATA
GATAAATATA ATTTGGGGAG TGATATTCTT ATTCCAGGTG TCCTTGATGT TGGTATGGAA
ATTAAAAGAC TGAGGTTTTC TTTATTACCT CAAGAAATGG AGAGGTATCG CCAACTGGGT
AAAGAGGTGG GCAAAATTAT GTCAGATACC TGTCACCATA TTGAAACGGG TAAGACCGAG
AATGAAATCA GGGCCCAGCT TGCTTCTAAA CTATGGGCTC ATAATATAAA TCCCCTTTTA
ATCCTGGTCG GTTCTGATGA ACGTATTTAT AATTACCGCC ACCCCATTCC AAAAGATAAA
AAAATAGATA AGTATGTTAT GGTAGTGACC TGTGCTGAGA GAGATGGTTT GATTGTAAAT
TTAACCCGGT TTGTCCACTT TGGAAACCTC CCTGATGAAT TAAAAAGAAA GTTGGAGGCT
GTGGTCAGGG TAGATGCCAG CTTTATACTC AACACCCGGG TTGGCAGTAA AATTTCTGAT
ATTTTCAGTA AAGCTATTGC TGTTTATGAA AATGAAGGTT ACCCTGGGGA ATGGCAATAC
CACCATCAGG GAGGGGCTAC AGGCTATGAA ACGAGGGATT ATATAGCAAC ACCGGATTTA
GAGGAGGTTG TCTGTCCCAA TCAGGCCTTT GCCTGGAACC CCTCGATAAA GGGTGTTAAG
AGTGAAGATA CTATTTTAGT TACAGAGGAA GGTTTTGAAA TCCTTACTGA GGACCCTGAC
TGGCCAGGGA TAGAAGTCCA ATACCAGGGA CAAAAAATAA AAAGGCCGGG GATTTTGGTA
AAATGA
 
Protein sequence
MEETLVKKEK IDKLMKEKGL DGVVLTSHSN ITWLTGIDNR IVFASDEGAV KLIIFKDRIE 
VVTNNIEAGR IREEEGLDQD YYKYIVDDWY RADNYLKVLI DKYNLGSDIL IPGVLDVGME
IKRLRFSLLP QEMERYRQLG KEVGKIMSDT CHHIETGKTE NEIRAQLASK LWAHNINPLL
ILVGSDERIY NYRHPIPKDK KIDKYVMVVT CAERDGLIVN LTRFVHFGNL PDELKRKLEA
VVRVDASFIL NTRVGSKISD IFSKAIAVYE NEGYPGEWQY HHQGGATGYE TRDYIATPDL
EEVVCPNQAF AWNPSIKGVK SEDTILVTEE GFEILTEDPD WPGIEVQYQG QKIKRPGILV
K