Gene Hhal_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1441 
Symbol 
ID4711161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1552480 
End bp1554465 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content70% 
IMG OID639855908 
Productpeptidase M1, membrane alanine aminopeptidase 
Protein accessionYP_001003010 
Protein GI121998223 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.51284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGGC CTACCCGGAT CCTCGCCGCT CTCCTTCTCG CCGCCCTGCC GGCGGCGGCC 
GTGGCCGCGT GGCCCGACAC CCCTCAAGCC GAGATGGACG TTCGGCTGCA CCCCGAAGAC
GGCCGCCTGG AGGGGCGCAT GGCCCTGCGA TTGCCGGACG AAGAGCCGCT GTCCCTGGCG
GTTGGCCCAG GGTTTCGGAT CGATCAGACC GAACTGACCG CCGGAAGGGT CGAGACCCTA
GGTCAGCGGG GCGTCATCCT GCATCCGGAC GAGGCCACCG AAGCCCGGCT GCGCTGGTCC
GGCGAACCCG ACGGACAGGG CCGGGGCAGC CACCTGCACA CCGAGGGGGC CTGGCTCGAG
GCGGCAGCCG GCTGGCACCC CCGTCCGGCA TCGCGGCGGA TGGGCTACCG GCTGATCATC
GCTGTGCCCG AGCCGCTCCA AGTGGTCGCC GAGGGGACGC GGGCCGAGGA GCAGAGTGAG
GACGGACTGC GCCGCGTCGA ATTCCACCAC CCGGCACCGG CACTCGGCAT CGCCCTGTTC
GCCGGCGAGT GGCAGCACCG CACCCGCGAG GCTCGGCACG GGACCGTCCA CACCTTCTTC
CCGGAGACCC TGGCGGAGCA CCACGAGACC TATCTGGAAC GCACCGCGGC CTACCTGGAC
GAGTACAGCG AGTGGATCGG CCCACCCCCA CACGAGACCT TCTCGGTGCT GGCCACTCCC
TACCCCGTGG GGCTGGCCTT CGCCGGATTC ACCGCCCTGG GCGAGCAGGT GATCCCCCTG
CCCTTCATCC CGGACACCTC GCTGCCCCAC GAGGTCGTCC ACAACTGGTG GGGACGCGGC
GTCTACACCG ACTATGACGA CGGCAACTGG AATGAAGCGC TGACCTACTA CATGGGCGAC
TACCACCAGG CCCTGCAGCG GGATATCGAC GAGGCCCGCC GCCTGCGTGG TGACTGGTTG
CGCAGCCAGG CCGCCCTCCC CGAGGTAGCC GACTACCCGC TGGGCGAGTT CCGGCACAAT
CGCGGCTCGG CGGATGAGAT CGTCGGCTAC CAGCGCGGCG CTTCCCTCTT CCACACCCTG
CACCGCACCC TGGGGGAGGC GGGCTTCGAC GAGGCGATCC GCCGGTTCTA TGAGCAGCAA
GTCCACCGTG AGGCCGGCTG GCCGGATCTC GAGGCCACCT TCAGCGACGC CGCCGAGGCC
GACGACGCTG AGACCGAAAC GATCCAGGCG CTCTTCCGGT GGTTCCTTTC AGCCACCGAG
CTCCCAGACC TTGAGATGGA CGAACGCACG CTGACCGTTG CCCGCGACGG CGACGACCGG
TACCGGGTCG AGGTCGAGAT CGACTGGGAC GAGGAAGGCT ATCCGGTCTC CATCCCGGTC
GCCCTCAAGG GCGACGAGGG CCGACTGAAC GAGCAGGAGA TCCACCTGCA GCCCGGAGAG
CGCACCCGCA TCGAACTCGC CAGCGAAACG CGCCCCCGCT ACCTGCAGGC CGACCCCGAT
CAGCATGTGT ACCGACAATT GGCCCTGGGC GAGGGGGTGG CCATCCTACG TGACACCCTA
CTGGCGGAAT CTGTGACCCT GGTCAGCGCC TGGGAAGACC TGGAGGCAAC CGCCAACCAG
GCTTTGCGCG GCGATGTCGA ACCCGGCGAA CCCGACCGGG ATCGCCCGCT GTTGATCGTG
GCGCCCCGAG AGGCCGTCGG CGAACACCTG GAGGCGGCGC AGTCGTGCAT CACCGAGCGC
ATCCGCCCCG TGGACCACGA CACCGTCGCC TGGGCCAGCA CCACCGGTGG CGGACAACCG
CTGATCGTCC TCGCCGCTAA GGATCGAGAG CAGGCGCGCC AGGCGCTGCA GCGTCTGGCG
CGCTACGGCC GCCACAGCTA CGTCGGTTTC GGCAGTGCCC GCGGCGACGC CGAGACCGGA
CTCTACGAAC CCGCCGATCG CCACGGTCTG CGTCTACCGC TGGCCGATCA GTTCGACGGC
GACTGA
 
Protein sequence
MRRPTRILAA LLLAALPAAA VAAWPDTPQA EMDVRLHPED GRLEGRMALR LPDEEPLSLA 
VGPGFRIDQT ELTAGRVETL GQRGVILHPD EATEARLRWS GEPDGQGRGS HLHTEGAWLE
AAAGWHPRPA SRRMGYRLII AVPEPLQVVA EGTRAEEQSE DGLRRVEFHH PAPALGIALF
AGEWQHRTRE ARHGTVHTFF PETLAEHHET YLERTAAYLD EYSEWIGPPP HETFSVLATP
YPVGLAFAGF TALGEQVIPL PFIPDTSLPH EVVHNWWGRG VYTDYDDGNW NEALTYYMGD
YHQALQRDID EARRLRGDWL RSQAALPEVA DYPLGEFRHN RGSADEIVGY QRGASLFHTL
HRTLGEAGFD EAIRRFYEQQ VHREAGWPDL EATFSDAAEA DDAETETIQA LFRWFLSATE
LPDLEMDERT LTVARDGDDR YRVEVEIDWD EEGYPVSIPV ALKGDEGRLN EQEIHLQPGE
RTRIELASET RPRYLQADPD QHVYRQLALG EGVAILRDTL LAESVTLVSA WEDLEATANQ
ALRGDVEPGE PDRDRPLLIV APREAVGEHL EAAQSCITER IRPVDHDTVA WASTTGGGQP
LIVLAAKDRE QARQALQRLA RYGRHSYVGF GSARGDAETG LYEPADRHGL RLPLADQFDG
D