Gene Hhal_0681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0681 
Symbol 
ID4710368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp764071 
End bp765570 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content75% 
IMG OID639855144 
Productleucyl aminopeptidase 
Protein accessionYP_001002265 
Protein GI121997478 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.516579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTCA AGACCAAAAG CGGGGATCCG GCGCGGCAGC GCACCGCCTG TGTCGTGGTG 
GGCGTCTACG AGCGCCGACG GATGAGCGAG GCGGCGCGGG CCGTCGACGC CGCCAGTGAC
GGTTACCTGA GCCATCTGCT GCGCCGGGGC GACCTTGAAG GCGAGGCCGG GCAGACCCTG
CTACTGCCCG ACTGCCCCGG GGTGCGCACC GATCGCGTAC TGCTCGTCGG CTGCGGGCGC
GAGCGCGACT TCAACGAGCG CACCTACCGC AAGGCCGTCA CCGCGGCGGC CCGCGCGCTG
GAGCAGGCGG GCACCGGCGA GGCGATCCTG TTCCTGCCCG AGCTACCCGT CCGCGGCCGC
GACGTGGCCT GGCGCGTGGC CGCCACCGCC GAGATCCTCG AGACCACGCT CTACCGCTTC
GACACCTACA AGAGCGACCC GCGCCCGCCG CGCCGCCCGC TGCGCCAGGC CACCCTGGCC
GTCCCGCGGC GTGCCGACCT GCGCCGCGCC CAGCCGGCGC TCACCCTGGG CCAGGCCGCC
GGCCGCGGCG CCAACTTCTC CCGCGACCTG GGCAACACCC CGGCCAACAT CTGCACCCCC
GGCTACCTGG GGGAACAGGC CGAGGCCCTG GCCCAGCGCT TCGACGGCGT GCGCGCCGAG
ATCCTCGGTC CGGCGGAACT CGAAGAGCAG GGCCTGGCGG CCCTGCTGGC CGTGGCCCGC
GGCGCCGAGG CGCCGCCCCG GCTGGTGGTG CTGCACTACC GCGGCGCCGA CGACGACCAG
GCCCCGGTGG CCCTGGTGGG CAAGGGCATC ACCTTCGACA GCGGCGGCAT CTCCATCAAG
CCGTCGGCGA GCATGGACGA GATGAAGTAC GACATGTCCG GCGCCGCCGC GGTCTTCGGC
GCCGTCCACG CCGCCGCCGA GGCGCAGCTG CCGCTGAACC TGGTGGCCGT CATCCCGGCC
ACCGAGAACA TGCCCGATGG CCGCGCCACC CGCCCCGGGG ACATCATCGA CAGCCTCGAC
GGGCAGCGCA TCGAGGTCCT CAACACCGAC GCCGAAGGCC GCCTGGTGCT GGCCGACGGC
CTCGCCTACG CCCGCCGCCT GGAGCCGAGC GAGGTGGTCG ACGTAGCCAC CCTGACCGGC
GCGGCCATCA TCGGCCTCGG CCACCACCGC CACGCGGTGA TGGGCAACGC CCCGGGGCTG
GTGCGCGACC TGCTCCAGGC CGGCGAGCGC GCCGCCGACC GCGGCTGGGA GCTGCCCCTG
GACGAAGAGT ACGATGAGCA GCTGCGCTCG CCCTTCGCCG ACGTGGCCAA TATCGGCGGA
CAGCCGGCGG GCACCATCAC CGCTGGCTGC TTCCTGCAGC GCTTCGCCCG CGGGCTGCGC
TGGGCGCACC TGGACATCGC CGGCACCGCC TGGAAGAGCG GCGAGCACAA GGGCGCCACC
GGGCGGCCGG TTCCCCTGCT CACCCACTTC CTCGCCGGCC GCGCCGGCTG GACGCTGTGA
 
Protein sequence
MELKTKSGDP ARQRTACVVV GVYERRRMSE AARAVDAASD GYLSHLLRRG DLEGEAGQTL 
LLPDCPGVRT DRVLLVGCGR ERDFNERTYR KAVTAAARAL EQAGTGEAIL FLPELPVRGR
DVAWRVAATA EILETTLYRF DTYKSDPRPP RRPLRQATLA VPRRADLRRA QPALTLGQAA
GRGANFSRDL GNTPANICTP GYLGEQAEAL AQRFDGVRAE ILGPAELEEQ GLAALLAVAR
GAEAPPRLVV LHYRGADDDQ APVALVGKGI TFDSGGISIK PSASMDEMKY DMSGAAAVFG
AVHAAAEAQL PLNLVAVIPA TENMPDGRAT RPGDIIDSLD GQRIEVLNTD AEGRLVLADG
LAYARRLEPS EVVDVATLTG AAIIGLGHHR HAVMGNAPGL VRDLLQAGER AADRGWELPL
DEEYDEQLRS PFADVANIGG QPAGTITAGC FLQRFARGLR WAHLDIAGTA WKSGEHKGAT
GRPVPLLTHF LAGRAGWTL