Gene Daro_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1590 
Symbol 
ID3568734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1706194 
End bp1707498 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content63% 
IMG OID637680058 
Productputative aminopeptidase 2 
Protein accessionYP_284809 
Protein GI71907222 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC CCGATTCCGC CCGCATCCAG GCACAGGACC TGCTCGACTT CATTGATGCC 
AGCCCCAGCC CCTGGCATGC CGTGCAGACC TGTGAAACCC GGCTGCAGGC CGCTGGTTTC
AGCCGGCTTG AAGAGCTTGA TCGCTGGACG CTGAGCGCGG GTGGTCGCCA TTACGTGGTA
CGCGGTGGCT CGTCGATCAT CGCCTTCATC ATCGGCCGGC AATCGGCCGC CGAGACCGGC
CTGCGGATGA TCGGCGCCCA TACCGATTCG CCAGGCCTGC GCCTCAAGCC GAAGCCGGCC
GAGGATGTGG CAGGCATGGT CAGGCTTGGT GTTGAAGTTT ACGGTGGCCC CATCCTCGCC
ACCTTTGCCG ACCGTGACCT GTCGCTGGCC GGGCGCGTCA ATGTCCGCAC GCCGGGCGGC
TTCACGACCA GGCTGGTGCA TTTCGCCGAA CCACTGCTCC GCCTGCCCAA CCTTGCCGTC
CACATGAATC GGGAAGTTAA CGAGAACGGC CTGAAGTTCA ACAAACAGAC TGAACTACCC
CTGCTACTGG GCGTTTCCGA AGACGGCACG AAAGCCGAGG CACGCTTCCG CCAGCCAATT
GCCGATCGGC TCGGTGTCGA ACCGGGTGAT CTGTTGACCT GGGAACTGAA CGCCTACGAC
ACGCAAAAAG GCAGTTTCTG GGGCGTGGAT CGCGAATTCG TGGCCAACAG CCAGCTCGAT
AATCTTGCCT CCTGCCATGC CGGACTGAGC GCCCTGCTTG CCACGAAAGA ACCCAATGCC
ACCTGCCTGT GCGCCTTTTT CGACCATGAA GAAGTCGGCA GCGAAAGCGC GGCCGGCGCT
GGCGGCAGTT TCGTCTCCGA CGTGATCAGC CGACTGGCCG CCAACGCCGG CCTCGATGGC
GAGGACCAAC GCCGGATGCT GGCGCGGAGC TTCTTCATCA GCGCCGACAT GGCCCACGGC
TGGCACCCCA ATTTTCCGGC CGCCTACGAG CCGTGCCACC ACGCGACGGT GAACGCCGGG
CCGGTCATCA AGAGCAATGC CAACCAGCGT TACAGCACCA ACGCCGATAC CGCCGCCCGC
TTCATGGCAA TCTGTGCCAA AGCAGGGGTG CCCTGCCAGC AATACGCCCA CCGTACCGAT
TTAGGCTGCG GCAGCACCAT CGGCCCCATC GTCGCATCAC GTTTGGGCAT ACCGAGCGTC
GACGTCGGAT CGCCGATGTG GGCCATGCAC AGCATCCGCG AAAGCGCCGG CGTCCTTGAT
CACGCCTATA TGATTTCTGC CTTGACCACG AGTTTCACGG ACTGA
 
Protein sequence
MNIPDSARIQ AQDLLDFIDA SPSPWHAVQT CETRLQAAGF SRLEELDRWT LSAGGRHYVV 
RGGSSIIAFI IGRQSAAETG LRMIGAHTDS PGLRLKPKPA EDVAGMVRLG VEVYGGPILA
TFADRDLSLA GRVNVRTPGG FTTRLVHFAE PLLRLPNLAV HMNREVNENG LKFNKQTELP
LLLGVSEDGT KAEARFRQPI ADRLGVEPGD LLTWELNAYD TQKGSFWGVD REFVANSQLD
NLASCHAGLS ALLATKEPNA TCLCAFFDHE EVGSESAAGA GGSFVSDVIS RLAANAGLDG
EDQRRMLARS FFISADMAHG WHPNFPAAYE PCHHATVNAG PVIKSNANQR YSTNADTAAR
FMAICAKAGV PCQQYAHRTD LGCGSTIGPI VASRLGIPSV DVGSPMWAMH SIRESAGVLD
HAYMISALTT SFTD