Gene Hhal_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1500 
Symbol 
ID4709311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1621043 
End bp1623079 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content69% 
IMG OID639855967 
Productpeptidase S15 
Protein accessionYP_001003069 
Protein GI121998282 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCG TCGAGGACTT CCCAGAGGCG ATCCGGGTGA TCGAGAACCT ATGGATCCCG 
ATGCGCGACG GGATCCGCCT GGCGGCCCGC GTATGGCTGC CCGAAGGGGC CGAACAGACG
CCGGTACCGG CGATCCTCGA GTACATGCCG TACCGCAAGC GCGATTTCAC CCGACTGCGC
GATGAGCCGC TGCACCACTA CTTCGCTGGG CACGGCTACG CCTCGATACG GCTGGATCTG
CGCGGCACCG GCGACTCCGA AGGCGTGGTG CTGGATGAGT ACCTGGCGCA GGAGCAGGAC
GATGCGGTGG ACGCCATCGC CTGGATCCGC GAGCAGCCCT GGTGCGACGG CGGCGTCGGC
ATGATGGGCC TCTCCTGGGG CGGTTTCAAC GCCTTGCAGG TCGCCGCACG ACGCCCGGCC
GGGCTCCGGG CCATCCTGAC CCTGTGCTCC ACCGACGACC GGTACGCCGA CGACGCCCAC
TACAAGGGCG GCTGCCTGCT CAATGAGAAC CTGACCTGGG GATCGAGCTT CTTCAGCCTG
GCAGCCTGGC CCCCGGACCC GGCCATTGCC GGCGCCGACT GGCGGGCCAT ATGGCAGCAA
AGGCTCGATC ACCTGCGCCT GTTCCCGGCG GTGTGGATGC GACACCCGCA CCGCGATGAC
TACTGGCGCC ACGGCTCGGT CTGTGAGGAC TACGCTGCCA TCGAGTGCGC CGTCTATGCG
GTGGGCGGCT GGTCCGACGG CTACGTGAAC GCCATCCCGC GCCTGATGGC CGGTTTGACC
TGCCCGCGCA AGGGGCTGAT CGGCCCCTGG CCCCACGCTT TCCCCCACGC CGCGGTGCCC
GGGCCGCGCA TCGGCATGTT CCAGGAGGCC GTGCGCTGGT GGGACCACTG GCTCAAGGAC
GCAGAGACCG GGATCATGGA TGAACCCATG CTCCGAGTCT GGTTGGAGGA GTGGGTTCGC
CCTTCGCCGC ACCTCGCCTA CCGGCCCGGG CGCTGGGTCG CCGAGGCGCA GTGGCCCTCC
CGGCGCATCC GACCGCGGCG TTGGCACCTG AACGTCGGCG CCCTGGGAGA GGCCCCCGAT
ACCCCCGAGG ACGCGGTGCG CGTGCGCTCG CCGCAGACCA CCGGACTCAA GGGCGGCGAG
TTCTACGGCT ACGGCGCCGA GGGCGAAGGG CCGCTGGATC AGCGCGAGGA CGACGGCAAA
TCCCTGGTCT TCGACTCCGA TCCGCTATCC CAGCGGCTAG AGATCCTCGG CAGCCCCGAG
GTCAACCTGG CGTTCACCGT GGACCGCCCG GTGGCGATGG CCGTCGTGCG TCTCAACGAC
GTGGCGCCGG ACGGCACGTC CGCCCGGGTG ACCTACGGTG CGCTCAATCT CTGCCACCGG
GAAGGCCACG CCAACCCCCG CCCCCTCCAG CCGGGCACAC GGTACCGGGT ACGCATCCCG
CTGAACGACA TCGCCTACGC CTTCCCCGCT GGCCACACGG TCCGGATCAG CATCTCAACC
GCTTACTGGC CGCTGATCTG GCCGTCGCCC GAGTCCGTCG CCCTGACCGT CCATACCGGC
GCCAGCACCC TGACACTCCC AGAACGGCCG CCCGACCCGG CAGACGAGGC ACTGCCCCCG
TTCGCACCGC CCGAACGCGG GCCGGGGGCA GAACCCACGG CCCTGGAGTG GGTCGATCCC
CACCGTACGG TGGCCCAAGA CCTGACCACC AACGAGACGA TCTACACCAC CTTTGGCGAC
GCCGCCGACC TGGAGGGGGC CGCCCTCGCC CGGCTCGAGG AGATCGACCT GGCCGTGGGC
CACACCATCC GCAGGTCCTT CCGCATCAAC GAGTCCGACC CCCTTTCGGC TCAGGCCCTG
ATCGAGCTGG ACGCCCGCCT GCACCGCACG GGCTGGGACG TGCGCATCGA ATGCCAGACC
CGTATGAGCG CAACTGTCGA GCACTTTCGG GTCAGCGCTG AGCTGGAGGT GTTCGAGAAT
GGGGAACGCA TCTTCCACCG CCACTACGAT GAATGGATCC CGCGGCAGCT GCTTTAG
 
Protein sequence
MRIVEDFPEA IRVIENLWIP MRDGIRLAAR VWLPEGAEQT PVPAILEYMP YRKRDFTRLR 
DEPLHHYFAG HGYASIRLDL RGTGDSEGVV LDEYLAQEQD DAVDAIAWIR EQPWCDGGVG
MMGLSWGGFN ALQVAARRPA GLRAILTLCS TDDRYADDAH YKGGCLLNEN LTWGSSFFSL
AAWPPDPAIA GADWRAIWQQ RLDHLRLFPA VWMRHPHRDD YWRHGSVCED YAAIECAVYA
VGGWSDGYVN AIPRLMAGLT CPRKGLIGPW PHAFPHAAVP GPRIGMFQEA VRWWDHWLKD
AETGIMDEPM LRVWLEEWVR PSPHLAYRPG RWVAEAQWPS RRIRPRRWHL NVGALGEAPD
TPEDAVRVRS PQTTGLKGGE FYGYGAEGEG PLDQREDDGK SLVFDSDPLS QRLEILGSPE
VNLAFTVDRP VAMAVVRLND VAPDGTSARV TYGALNLCHR EGHANPRPLQ PGTRYRVRIP
LNDIAYAFPA GHTVRISIST AYWPLIWPSP ESVALTVHTG ASTLTLPERP PDPADEALPP
FAPPERGPGA EPTALEWVDP HRTVAQDLTT NETIYTTFGD AADLEGAALA RLEEIDLAVG
HTIRRSFRIN ESDPLSAQAL IELDARLHRT GWDVRIECQT RMSATVEHFR VSAELEVFEN
GERIFHRHYD EWIPRQLL