Gene Hhal_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1185 
Symbol 
ID4709244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1288520 
End bp1289878 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID639855658 
Productpeptidase M24 
Protein accessionYP_001002762 
Protein GI121997975 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGA CCATCGATTT CACCCAGCTT GCCTGCGAGC AGCGGGAGAC CCTGGCTCGG 
CGCATCGGTG AGAGCGCGGT GGTGGTGGTC CCAGCGGCGC GGGAACAGCC CCGCAACCGC
GATGTGGACC ACCCCTTCCG CCAGGACAGC GACTTCCGCT ACCTCACCGC CTTCCCCGAA
CCCGACGCGG TGGCGGTGCT CGCCCCCGGC CGGCCCGAGG GCGAGTATGT GCTGTTCGTC
CGTGAGCGCG ACCCGGAGGC GGAACGGTGG GCCGGGGCGC GCACCGGCCC CGAGGCCGCC
TGCCAGGCCT ACGGCGCCGA TCAGGCCTGG CCGCTGGGAG AGCTCGATCA GCGACTGCCC
GACCTGCTCG TCGGCCGGGA ACGGATGATC GCGCCGCTGG GCCGCGACGA GCACTGGGAC
CGCCAGCTCC TGCAGTGGCT GCAGGCCGGG CGGGCGCGAG CCCGGGGCCA GGCCGTCGCC
CCGGACCGCA TCGAGCTGCT CGACCGCAAC ATCCACGAGC AGCGGCTGAT CAAGCGCCCC
GCCGAGCTCG AAGCGATGCG CCGGGCAGCC GGCATCTCGG TGGCGGCGCA TCGGCGCGCC
ATGCAGGCCG TCCAGTCGGG GATGCCCGAG TACGCGCTGG CTGCCGAGCT GCTCGGCATC
TTCCACCGAC ACGGCGGCGA GGCCGCCTAT CCGAGCATCG TCGCTGGGGG CGCCAACGCC
TGCGTACTTC ATTACGTCAC CCTGCGCAAC ACACTGCACG AGGGCGACCT GGTCCTCATT
GACGCCGGCG CCGAGGTGGA CGGCTACGCC GCCGATATCA CCCGCACGTT CCCGGTCAGC
GGGGTCTTCA GCGCCGAGCA GCGAGCCGTC TACGACGTGG TCCTCGAGGC CCAGGAGGCA
GCCATCGGGC AAGTGTGCAG CGGCAACGAC TTCGACGCCT TCCACCGCAC CGCCACGCGC
ATCCTCACCC AGGGCATGGT GGATCTCGGC TGGCTCCGGG GCGAGGTGGA CGGACTGATC
GAGCAGGGCG CCCACCGGCG CTTCTTCCCC CACCGCACCG GTCACTGGCT GGGACTGGAC
GTACACGACG TCGGCAGCTA TGCAGTAGAG GGAGCGTGGC GCGTCCTCCA GCCTGGCATG
GTGGTGACCG TCGAGCCGGG GCTCTACTGC CCGCCGGGCA GCGAGGAGGT GGATCCACGC
TGGCACGGGA TCGGCGTTCG CATCGAGGAC GACGTGGTTG TCGAGCGGGA GACCCCGCGC
ATCCTCACCA GCGGGGTGCC GAAGACCCCC GAGGCCATTG AGGATCTGAT GGGCGCCGTG
CGCGGCGCAG GCTACGAGGA AAGTGGAGAC TTCGACTGA
 
Protein sequence
MNETIDFTQL ACEQRETLAR RIGESAVVVV PAAREQPRNR DVDHPFRQDS DFRYLTAFPE 
PDAVAVLAPG RPEGEYVLFV RERDPEAERW AGARTGPEAA CQAYGADQAW PLGELDQRLP
DLLVGRERMI APLGRDEHWD RQLLQWLQAG RARARGQAVA PDRIELLDRN IHEQRLIKRP
AELEAMRRAA GISVAAHRRA MQAVQSGMPE YALAAELLGI FHRHGGEAAY PSIVAGGANA
CVLHYVTLRN TLHEGDLVLI DAGAEVDGYA ADITRTFPVS GVFSAEQRAV YDVVLEAQEA
AIGQVCSGND FDAFHRTATR ILTQGMVDLG WLRGEVDGLI EQGAHRRFFP HRTGHWLGLD
VHDVGSYAVE GAWRVLQPGM VVTVEPGLYC PPGSEEVDPR WHGIGVRIED DVVVERETPR
ILTSGVPKTP EAIEDLMGAV RGAGYEESGD FD