Gene Hhal_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0120 
SymbolpepN 
ID4710619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp134820 
End bp137531 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content67% 
IMG OID639854578 
Productaminopeptidase N 
Protein accessionYP_001001716 
Protein GI121996929 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.269734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTCAG GGTGCCCGGA TCAGGTAAGC TGCTTATTCC ACACCGATCA CCACACTATA 
TTCAACTCGA GCGCCCCCAT GAGTGCATCC CGACCGCAGA CCATCCGCCT GCGCGACTAC
CAGCCGCCAG CCTTCCTCGT CGAACGCATC CACCTCCGCG TCGACCTCAG CGGCGGCACC
GCCCAGGTGG ATGCCACTCT GGATCTGCAC CGCAACCCGG CTGCCGACAA CGACGCCCCC
CTGCGCCTGG ACGCCGAATT CCTGGATCTG GAGTCGCTGA CCCTGGACGG CGCCGAGCTG
GAGCCGAGCA CGCTGCGCGA CGAACAGGGC ACGATCGTCC TCCATGATGT CCCGCAGCGC
TGTCGGGTGG AGTCGGTCAG CCGCTTCGAC CCGGCAGCCA ACACGGCCCT CTCCGGTCTG
TATCGCTCCG GCGGTATGTT CTGCACCCAG TGCGAAGCGG AAGGGTTCCG TCGCATCACG
CCCTACCCCG ACCGCCCGGA CGTGCTGGCC CCGTTCACCA CCACGGTGGT CGCCGACCGC
GCCACCTGTC CGGTACTGCT ATCCAATGGG GATTGCATCG ACCGGGGTGC GCTCGACGCG
GAGCGCCACT ACGCCGTCTG GCACGACCCC TTCCCCAAGC CGTCGTACTT GTTCGCCCTG
GTCGCCGGCG ACCTGGCCTG TCAGGAGGCC ACCTTCGTGA CCGCCTCGGG ACGCGAGGTC
GCCCTGCACT TCTATGTCGA GCCCGAAAAC GCCGGACGCA CCGAGCACGC CTTGGCTGCC
CTGCAGCGGG CCATGCGCTG GGACGAGACC CACTACGGCC TGGAGTACGA CCTGGACACC
TACATGGTCG TCGCCGTCGG CGACTTCAAC ATGGGCGCCA TGGAAAACAA GGGGCTGAAT
GTCTTCAACA CGCAGTTCGT CCTGGCCAGC CCCGACACCG CCACCGACGC CGATTACGAG
AACATCGAGG CGGTCATCGG CCACGAGTAC TTCCACAATT GGACCGGCAA CCGGGTCACT
TGCCGCGACT GGTTCCAGCT CTCCCTCAAG GAGGGTCTGA CCGTCTTCCG CGAGCACCAG
TTCGCCGAGG CCATGGGCTC GGAGGCCGTC CAACGCATCG GCCAGGTCCG TCTGCTGCGC
ACGGCCCAGT TCCCCGAGGA CGCTGGCCCC ATGAGCCATC CGGTCCGCCC CGACTCCTAC
GTGGAGATCA ACAACTTCTA CACCGCAACC GTCTACGCCA AAGGCGCTGA AGTCATCCGC
ATGTACCACA CGTTGCTCGG CGACGATGCC TTTCGACGCG GGGTCCAGCG CTACCTGCAA
CGCCACGACG GAGAGGCCGC CACCATCGAG GATTTCCTCG CCGCCATGGA GGAGGCCGGC
GGGCTGGATC TGCAGCAATT CGCACTGTGG TACACCCAGG CGGGCACTCC GCGGATCGAG
GTCGAAGACC ACTACGACGC GGAGCACAAC ACCTACACCC TGATCTGCCG GCAGTCCCTG
CCGGCCAGCC CCGGCCAGCC TCACAAGGAT CCGATGCACA TCCCGCTGGC CGTCGGGCTG
CTCGGCCGAG ACGGCAGACC GCTCGCCGCC CGTCGACCGG ACGAGTCCAC TGCCCACGCC
CACACCCGCG TCCTCGAACT ACGCGAGCCC GAGCAGCACT TTGTCTTTGA GCAGTGCCCG
GAACGCCCCG TGCCCTCGCT GCTGCGGGGG TTCTCGGCAC CGGTCAAGCT GCACTATCCG
TACACCGATG ACGACCTCGG CTTCCTGCTC GCCCACGACT CCGACCCATT CGCACGCTGG
GAAGCGGGAC AGCAACTGGC GCTGCGCGTC CTCCTCGATG AGGCGCAGGG CACCCCACGC
CCGGACGGCA TTGAACGGCT TGAGGAGGCG TTCCGCGCCT CTCTGGAAGC GCCCGACACC
GACCCCGCCC TGGTCGCCGA AGCGCTGACC CTGCCCGGCG AGACCTACCT CGCCGAGCAG
ATGGACGTCG TGGACCCCCA GGCGATCCAC GACGCCCGCC AGCGGGTCCG GGCCATTCTG
GGCGAGGCAC TCGAGGGCCA CTGGCTGATC CTGCACAGTC AGCACAGCGG CCCGTGGCAC
TACGAGCCCG ATGAGGTCGC CCGGCGCCGC CTGCGCAACC TGGCCCTCGG CTACCTGAAC
GCCGGCTCGG AGCGCCACCT GAGCCGGGCA CTCGAGCAAC TCGAACGCAG CGACAATCTC
ACCGATCGAC TCGCCGCGCT GGCGGTGCTC GCGGACAGTC CTCGGGCCGA GGCCGAGCAA
GCGGTAGACG CCTTCTATCA GCGCTGGTCC GATGAACCGC TCGTAGTCGA TAAGTGGTTC
CGGACCCAGG CGCTGGCCGA CCGGCCGGAC ACCGTTTACC GGGTCCATCA GCTCACCGAG
CACCCGGACT TCACGCTGGA CAACCCAAAC CGGGCCCGCG CCCTACTGGG CGCGTTTGCC
CAGGGCAACC CGGCCCACTT CCACAGCCCG GACGGCAGCG GTTATCGCCT CCTCGGCGAG
CACGTCCTTC GCCTCGACCC GAACAACCCC CAGCTCGCCG CCCGGCTGCT GGCACCCCTG
GCCCAGTGGC GCCGCTACAA CGCGAGCCGT CGCCACGCCA TGCGCGAGCA ACTCGAACGG
ATCCTTGAGC GCGATGCGCT GTCGAAGGAT GTGTACGAGG TCGCTTCCAA AAGCCTGGGG
GCGATGAGCT AA
 
Protein sequence
MCSGCPDQVS CLFHTDHHTI FNSSAPMSAS RPQTIRLRDY QPPAFLVERI HLRVDLSGGT 
AQVDATLDLH RNPAADNDAP LRLDAEFLDL ESLTLDGAEL EPSTLRDEQG TIVLHDVPQR
CRVESVSRFD PAANTALSGL YRSGGMFCTQ CEAEGFRRIT PYPDRPDVLA PFTTTVVADR
ATCPVLLSNG DCIDRGALDA ERHYAVWHDP FPKPSYLFAL VAGDLACQEA TFVTASGREV
ALHFYVEPEN AGRTEHALAA LQRAMRWDET HYGLEYDLDT YMVVAVGDFN MGAMENKGLN
VFNTQFVLAS PDTATDADYE NIEAVIGHEY FHNWTGNRVT CRDWFQLSLK EGLTVFREHQ
FAEAMGSEAV QRIGQVRLLR TAQFPEDAGP MSHPVRPDSY VEINNFYTAT VYAKGAEVIR
MYHTLLGDDA FRRGVQRYLQ RHDGEAATIE DFLAAMEEAG GLDLQQFALW YTQAGTPRIE
VEDHYDAEHN TYTLICRQSL PASPGQPHKD PMHIPLAVGL LGRDGRPLAA RRPDESTAHA
HTRVLELREP EQHFVFEQCP ERPVPSLLRG FSAPVKLHYP YTDDDLGFLL AHDSDPFARW
EAGQQLALRV LLDEAQGTPR PDGIERLEEA FRASLEAPDT DPALVAEALT LPGETYLAEQ
MDVVDPQAIH DARQRVRAIL GEALEGHWLI LHSQHSGPWH YEPDEVARRR LRNLALGYLN
AGSERHLSRA LEQLERSDNL TDRLAALAVL ADSPRAEAEQ AVDAFYQRWS DEPLVVDKWF
RTQALADRPD TVYRVHQLTE HPDFTLDNPN RARALLGAFA QGNPAHFHSP DGSGYRLLGE
HVLRLDPNNP QLAARLLAPL AQWRRYNASR RHAMREQLER ILERDALSKD VYEVASKSLG
AMS