Gene Daro_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3676 
SymbolpepN 
ID3566788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3950318 
End bp3952924 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content63% 
IMG OID637682149 
Productaminopeptidase N 
Protein accessionYP_286875 
Protein GI71909288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG ATACCCCCCA GACCATCTAT CTCAAGGACT ACACCGTCCC CGCCTATCTG 
GTCGACACCG TCGACCTCGA TTTCAATATC GAAACCGGTG GCACCACCGT GAGCTCGATG
CTGGCCATGC GCCGCAACCC GGCCGCGGCC GGTCAGCCGC TGGTCCTCGA CGGCGACGAA
CTGGAAACGC TCAGTGTCAC GGTCGACGGC CACCAGGTGC CCTTTGCGGC CACGGCCAGC
ACCCTGACCA TCACCGACCT GCCGGAAACC TTCACGCTGC AGACTGTCGT CCGCATCGAC
CCGGACAAGA ACACCCGCCT GTCCGGTCTC TATCGCTCGA CCGATGGCTA CTTCACCCAG
TGCGAGGCAC AGGGTTTCCG CCGGATCACC TGGTTCCTCG ACCGCCCAGA CGTGATGTCG
ACCTACACTG TGACGCTGCA CGCCGACCAG GCCACCTATC CGGTCCTGCT CGCCAACGGC
AATCCGGTTG ATGCCGGAGA AGAAGCCAAT GGCCGTCACT GGGCAAAATG GGCCGACCCG
TTCAAGAAGC CGGCCTACCT GTTCGCCGTC GTCGCCGGCA AGCTTGACGT GCTGAAAGAC
ACCTTCAGGA CCGCTTCCGG CCGCAGCGTG CAGCTCGCCA TCTACGTCGA GCCGGGCAAG
CTCGACCAGT GCCCGCACGC CATGGCCGCG CTGCAGAAAT CCATGAAATG GGACGAGGAA
CGCTTCGGCC TCGAATGCGA TCTCGACCAT TACATGATCG TCGCCGTCGG CGACTTCAAC
ATGGGCGCGA TGGAGAACAA GGGCCTGAAC ATCTTCAACA CGAAGTATGT ACTCGCTCGC
AGCGATGTGG CCACCGATGT CGATTTCGAG AACATCGACC GCGTCGTCGC CCACGAATAT
TTCCACAACT GGACCGGTAA CCGCGTCACC TGCCGTGACT GGTTCCAGCT CTCGCTGAAG
GAAGGCCTGA CCGTCTTCCG CGACCAGGAG TTCGGGGCCG ACCTGCACAA CCGCCAGACC
GCCCGTATCC GCGAAGTACG CGGCCTGCGC GCTGCCCAGT TCCCGGAAGA TGCTGGCCCG
ATGGCCCACC CGATCCGCCC GGCCAGCTTC GTTGAGATCA ACAATTTCTA CACCTCGACC
GTCTATGAAA AAGGCGCCGA AGTCATCCGG ATGATCCAGA CCCTGATCGG CCGCGATGCT
TTCCGCGCCG GGATGGATGA ATACTTCCGT CGCCATGACG GCCAGGCAGT GACCTGCGAG
GATTTCGTCG CCGCGATGAG CGCTGCCTCC GGCTTCGACT TCACGCAATT CATGCGCTGG
TACAACCAGC CGGGCACGCC GCATGTCGCT GTGGACGGCC ACTTCGATCC AGACAGTCAG
ACCTACACGC TGACCTGCAC GCAGTCGAAC CCACGCGCCA GCGACGAGCA GCCCTACCTG
ATCCCGATCC GCGTCGCACT GTTTGGCGAG GACGGCAGGC TGCTGCCCAA CAGCGAACGT
CTGCTGCATA TGACGGCCGC CACCCAATCC TTCGTCTTCA ACGACCTGAG CAGCGAACCC
GTTCCGTCGC TGCTGCGCGA TTTCTCTGCC CCGGTCATCC TCAACTTCGA CTACACGCCG
GAGCAACTGA CCCTGCTGCT GGCCCATGAA AGCGACCCGT TTAACGCCTG GGAAGCTGGC
CAGCGCCTGG CTTCGACACT GATTCTCGAG GCCACCGCAG CCATCGCCGC CGGCCGGCAG
CTGGTCTGGC CCGCCAGCTT CGTCGAGGCC GTTCGCCGCC TGCTGCAAAC CCAAGCCCGG
CGTGGCGCCG CCTTCGTGGC CGAAGCCCTA ACCCTGCCCG GCGAATCGAC GCTGGCCGAA
GCACTGGACG TCGTTGATCC CGATGCGCTG CATGCTGCCC GTAACGCGCT GCGCCGACAT
CTGGCCGAGC AACTCGAAGG TGAATTCTCC GGTCTTTACG CCGGGCTCGC CCCCAACGCT
GCCTACGCAC CGACCAGCGA ACAGGCCGGC CGGCGCGCCC TGCGCAATGC CTGCCTGGGC
TATCTGCTCG AACTCGATAC GCCAGCCGTC CGCCAACTGG CGTTGCAGCA GTTCGCTAGC
GCCGACAACA TGACCGACCA GTTCGCCGCA CTGTCGGTGC TGGCCAACGT CAATACCGAC
ACCTGCCCGG AACGCGACAA GGCACTGGCT GATTTCTACG CCCGCTGGCA ACACGAAGCG
CTGGTTGTCG ACAAATGGCT GGCCGTTCAA TCAACCAGCC GCCGCCCGGA TACGCTGGAT
ACGGTCAGGG CGCTCACCGC CCACCCGGCC TTCGACATCG GTAATCCGAA CAAAGTCTAT
TCGCTGATCC GCGCCTTTGG CGCCAATCTG GCACGCTTCA ATGCCGCGGA CGGCAGCGGT
TATGCCTTCA TCGCCGAACG GGTGATTGAA CTGCACGACC GTAACCCGCA AGTCGCTTCG
CGTCTGGCCC GTTGCTTCGA CCGTTGGAAG AAATTCGACA CCGGCCGACA GCGCCACGCA
CGTGCGGCGC TGGAAAGCAT CCGCGATCAT GCCAACCTGT CGCGCGACGT GCTGGAAGTC
GTGACGCGTT CGCTGAGTGC TGACTGA
 
Protein sequence
MKTDTPQTIY LKDYTVPAYL VDTVDLDFNI ETGGTTVSSM LAMRRNPAAA GQPLVLDGDE 
LETLSVTVDG HQVPFAATAS TLTITDLPET FTLQTVVRID PDKNTRLSGL YRSTDGYFTQ
CEAQGFRRIT WFLDRPDVMS TYTVTLHADQ ATYPVLLANG NPVDAGEEAN GRHWAKWADP
FKKPAYLFAV VAGKLDVLKD TFRTASGRSV QLAIYVEPGK LDQCPHAMAA LQKSMKWDEE
RFGLECDLDH YMIVAVGDFN MGAMENKGLN IFNTKYVLAR SDVATDVDFE NIDRVVAHEY
FHNWTGNRVT CRDWFQLSLK EGLTVFRDQE FGADLHNRQT ARIREVRGLR AAQFPEDAGP
MAHPIRPASF VEINNFYTST VYEKGAEVIR MIQTLIGRDA FRAGMDEYFR RHDGQAVTCE
DFVAAMSAAS GFDFTQFMRW YNQPGTPHVA VDGHFDPDSQ TYTLTCTQSN PRASDEQPYL
IPIRVALFGE DGRLLPNSER LLHMTAATQS FVFNDLSSEP VPSLLRDFSA PVILNFDYTP
EQLTLLLAHE SDPFNAWEAG QRLASTLILE ATAAIAAGRQ LVWPASFVEA VRRLLQTQAR
RGAAFVAEAL TLPGESTLAE ALDVVDPDAL HAARNALRRH LAEQLEGEFS GLYAGLAPNA
AYAPTSEQAG RRALRNACLG YLLELDTPAV RQLALQQFAS ADNMTDQFAA LSVLANVNTD
TCPERDKALA DFYARWQHEA LVVDKWLAVQ STSRRPDTLD TVRALTAHPA FDIGNPNKVY
SLIRAFGANL ARFNAADGSG YAFIAERVIE LHDRNPQVAS RLARCFDRWK KFDTGRQRHA
RAALESIRDH ANLSRDVLEV VTRSLSAD