Gene Daro_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3662 
Symbol 
ID3567604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3936334 
End bp3937632 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content63% 
IMG OID637682135 
Productaminopeptidase P 
Protein accessionYP_286861 
Protein GI71909274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACG CCCACTTTCT CGCCCGCCGC AAGCGCCTGC TGAAGACCAT CGGCGACGGC 
GTCGCCATCG TACCCACCGC ACCGGAAGTC ATTCGCAACC GCGATGCGCA TCATCTTTAC
CGATTCGACA GCTATTTCTG GTATCTGACT GGCTTCCCGG AACCAGAAGC GGTCGTTGTG
CTAATCGGCG GCAAGAAGCC GAAATCCATC CTCTTCTGCC GCGAAAAGCA TGAAGAACGC
GAAATCTGGG ACGGCTATCG CTACGGCCCG AAAGCGGCAA AAACCGCCTT CGGCTTCGAC
GCTGCCTATC CGATCGAGCA ACTCGACAAG AAACTGGCCG AGTTCCTGGT CGACCGCGAC
ACACTGTGGC ACGCCATCGG TCACGACGCC GAATGGGACG CCCGGATCGC CAAGGCCCTG
AACGAAGTCC GCGCCCAGAC CCGGGCCGGC AAGCGGGCGC CGCGCGCCAT TCACGACCTG
CGCGCCGAAC TCGACGGTAT GCGCCTGGTC AAGGACAGTG CCGAGGCCGG CATCCAGCAA
CGCTCGGCCG ATATTGCCAG CGCCGGCCAC GCCCGCGCCA TGCGCGCCTG CCGCCCCGGC
ATGGCCGAGT ACGAACTGGA AGCCGAACTG ACTTACGAAT TCCGCAAGCG CGGTGCCGAT
GCCCATGCCT ACACGCCCAT CGTTGCCGGT GGCACCAACG CCTGCGTGCT TCATTACGTG
TCGAACGACA AGGTACTCAA CGACCACACC CTGGTCCTGA TCGACGCTGG TTGCGAAGTA
GACGGTTACG CCGCCGACAT CACCCGTACT TTCCCGGTCA ATGGCCGCTT CAACCCCGCG
CAGAAGGATG TGTACGAAAT CGTCCTCGCC GCGCAGACGG CGGCCGTCGC CGCCACCGCG
CCAGGTCGCC ATTTCATGGA AGGCCACGAT GCCGCCGTCC GCGTGCTGAC TCAAGGCCTG
ATCGACCTCA AGCTGCTCAC CGGCAACCTC GACAATCTGA TCGAAAAAGG TGATTACAAG
CGCTTCTACA TGCACCGCAC CGGCCACTGG CTCGGGCTGG ATGTGCACGA CGCCGGCGAA
TACAAGGTCG GCGACGCATG GACGACCTTG CAGCCAGGCA TGACCCTGAC CGTCGAACCC
GGCCTCTACA TCCGCCCCGG CACCGATATC CCGCCAGCAC TGGCCGGCAT CGGCATCCGC
ATCGAGGACG ACGTGCGCGT CACGGAGAAT GGTTGTGACA TCTTCACCAC GGCGCCGAAA
ACGGTGGCCG AGATCGAGGA AGTCATGCGC CATGACTGA
 
Protein sequence
MTHAHFLARR KRLLKTIGDG VAIVPTAPEV IRNRDAHHLY RFDSYFWYLT GFPEPEAVVV 
LIGGKKPKSI LFCREKHEER EIWDGYRYGP KAAKTAFGFD AAYPIEQLDK KLAEFLVDRD
TLWHAIGHDA EWDARIAKAL NEVRAQTRAG KRAPRAIHDL RAELDGMRLV KDSAEAGIQQ
RSADIASAGH ARAMRACRPG MAEYELEAEL TYEFRKRGAD AHAYTPIVAG GTNACVLHYV
SNDKVLNDHT LVLIDAGCEV DGYAADITRT FPVNGRFNPA QKDVYEIVLA AQTAAVAATA
PGRHFMEGHD AAVRVLTQGL IDLKLLTGNL DNLIEKGDYK RFYMHRTGHW LGLDVHDAGE
YKVGDAWTTL QPGMTLTVEP GLYIRPGTDI PPALAGIGIR IEDDVRVTEN GCDIFTTAPK
TVAEIEEVMR HD