Gene Hore_04150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04150 
Symbol 
ID7314090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp431108 
End bp432628 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content39% 
IMG OID643610838 
ProductCarboxypeptidase Taq 
Protein accessionYP_002508168 
Protein GI220931260 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones72 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCAATA TCAAACAGTC AAATACTGAA AAATCGTTTC TGGAGTTTGT TCAAAAGATT 
AAGGCCTACG ATAGTGCGCA GTCTCTCCTC TACTGGGACA TGGTTACCGG AATGCCGGAA
AAAGGTGTTT CTAAGAGGGC TTCTATAATC AGTCTCCTCT CAACAGAAGT CTTTAAAATG
TCTACCTCTG ATAAAATGAA GGAATACCTT GATAACTTTT CCCGGCAAGA GGTAAATAAA
GACCTTGATC CTGTAATGAA GGGAATAGTA AGGGAATGCA AAAAAAATTA TGACAGGTTC
AAAAAAATAC CGGAAGATAA ATACCGTGAC TTTGTCAGGT TAAAATCAGA AGCTGAATCT
ATCTGGAAAA AAGCTAAACA AAATGACGAT TTCAATCTTT TCCGGCCATA CCTTGAAAAA
ATAGTTGACT ACCTCAATGA ATTTATCGAT ATCTGGGGGT ATGAGGGGAA TAAATATAAT
ACACTACTCG ATCATTATGA ACCAGGGGTT ACTGTGGAGA AGCTGGATGA TATTTTTACC
GATCTCAAGG CCAGTATTGT CCCCTTACTT AAAAGGGTTA AGGATGCTCA AGATAAACCG
GATGATTCCT TCCTGAAAGA ATATTATGAC CCTGCAACCC AGGAAAAACT ATGTGAGCTA
CTTTTAGAAG AAATCGGTTA TGATTTTAAA GCCGGCAGAC TGGATGAAAG TGAGCACCCC
TTTACCATTG GTATTAATAG TGGAGATGTC AGGGTAACAA CCCATTATTA CCCCCACAAT
TTAACCAGCG CCCTGTTCAG TTCACTCCAC GAAGGGGGCC ATGCTATATA TGACCAGAAT
ATTGATCCTG AACTCGATGA AACCCCATTA CATGATGGGG CCTCTATGGG TATCCATGAG
TCCCAGTCCA GGTTCTGGGA AAACATTATT GGCAGGAGCT ATAACTTCTG GAAGTCTTAT
TATGGAAAGG TTCGGAAGCT CTTCCCTGAA CAGCTTAATG ATATATCCCT AGATGAATTT
TACCGGGCTA TAAATAAAGT TGAGCCATCA ATGATCAGAG TAGAAGCCGA TGAACTTACC
TATAACCTTC ATATCATGGT CAGGTATGAA ATAGAAAAGG CTTTAATAAA CCGGGAGCTT
GAGGTTGCTG AACTCCCTGA AGTCTGGAAT CAAAAAATGA AAGAATACCT GGGCATCGAA
CCAGAAAATG ATAAGGAAGG GGTTCTCCAG GATGTCCACT GGTCAAATGC CCTGTTTGGT
TATTTCCCCT CCTATGCCCT GGGGAATATC TATGCAGCCC AGTTTTATAA CACTATTAAG
AAAGAAATTA ATAATTATGA TGAACTGATA AGTAAGGGGC ATTTCCAGCC CATCAAGGAA
TGGCTCGGTG ACAAGATACA TAAATACGGT AAACTCCTTA CTCCAACAGA AATAATTAAA
AAGGTCACCG GAGAAGAAAT TAATTCCAGG TACCTGATTA AATACCTTGA AAATAAATAT
AGTAAGATAT ATAAACTATA G
 
Protein sequence
MFNIKQSNTE KSFLEFVQKI KAYDSAQSLL YWDMVTGMPE KGVSKRASII SLLSTEVFKM 
STSDKMKEYL DNFSRQEVNK DLDPVMKGIV RECKKNYDRF KKIPEDKYRD FVRLKSEAES
IWKKAKQNDD FNLFRPYLEK IVDYLNEFID IWGYEGNKYN TLLDHYEPGV TVEKLDDIFT
DLKASIVPLL KRVKDAQDKP DDSFLKEYYD PATQEKLCEL LLEEIGYDFK AGRLDESEHP
FTIGINSGDV RVTTHYYPHN LTSALFSSLH EGGHAIYDQN IDPELDETPL HDGASMGIHE
SQSRFWENII GRSYNFWKSY YGKVRKLFPE QLNDISLDEF YRAINKVEPS MIRVEADELT
YNLHIMVRYE IEKALINREL EVAELPEVWN QKMKEYLGIE PENDKEGVLQ DVHWSNALFG
YFPSYALGNI YAAQFYNTIK KEINNYDELI SKGHFQPIKE WLGDKIHKYG KLLTPTEIIK
KVTGEEINSR YLIKYLENKY SKIYKL