Gene Daro_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3681 
Symbol 
ID3566793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3956048 
End bp3957436 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content65% 
IMG OID637682154 
Productargininosuccinate lyase 
Protein accessionYP_286880 
Protein GI71909293 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCA ACGCCAATCA ATACACTTGG GCCGGCCGCT TCTCGGAGCC GGTTTCCGAT 
CTCGTCAAAC GTTACACCGC CTCGGTCGAT TTCGACCAGC GCATGTGGCG CCAGGACATC
CGCGGCTCGC TGGCGCACGC CCGCATGCTG GCCAAGCAAG GCATCATCGC CGCCGCCGAC
CTGGCCGACA TCGAGCGCGG CATGGCTATC GTCACCGAGG AAATCGAGTC GGGCAAATTC
GAATGGTCGC TCGACCTCGA AGACGTGCAC CTGAATATCG AGAAGCGCCT GACCGCGCTG
GTTGGCGATG CCGGCAAGCG CCTGCACACC GGCCGCTCGC GCAACGATCA GGTCGCCACC
GACATTCGCC TCTACCTGCG TGACTCGATC GACGACATTC TGGTGCTGAT CAAGGCTTTC
CGTTCAGCCC TGGTTGATCT GGCCGAAAAG GAAGCGGCAA CGCCGATGCC CGGCTTCACG
CACCTGCAGG TCGCCCAGCC GGTCACTTTT GGCCACCACA TGCTGGCCTA CTTCGAAATG
TTCGGCCGCG ACGCCGAGCG CTACGCCGAC TGCCGCAAGC GCGTCGCCCG CCTGCCGCTC
GGCGCCGCCG CGCTGGCCGG CACGACCTAC CCGATCGACC GCGCCTACGT TGCCGAACAG
CTCGGCTTCG AAGGCGTCTG CGAAAACTCG CTGGATGCCG TGTCGGATCG CGACTTCGCC
ATCGAATTCA CCGCCGCCTG CGCCCTGCTG ATGATGCACA TCAGTCGCCT GTCGGAAGAA
CTGGTCATGT GGATGAGCCC GCGCATCGGC TTCATCCAGA TCGCCGACCG CTTCTGCACC
GGCTCGTCGA TCATGCCGCA GAAGAAGAAC CCGGACGTGC CGGAACTGGC GCGTGGCAAG
ACCGGTCGGG TCTATGGCCA GCTGATGAGC CTGCTGACCC TGATGAAGTC GCAGCCGCTG
GCCTACAACA AGGACAATCA GGAAGACAAG GAACCGCTGT TCGACGCCGT AGACACCGTC
ACTGACACGC TGCGCATCTT TGCCGACATG GCCGGCGGCA TCACCGTCCG CGCCGACAAC
ATGAAGGCCG CACTGACCCA GGGCTTCGCC ACCGCCACCG ACCTCGCCGA CTATCTGGTC
AAGAAGGGCC TGCCTTTCCG CGATGCCCAC GAAGCCGTCG GCCACGCCGT CAAGGCCGCC
GAGCAAAAGG GCGTCGACTT GCCGCAACTG ACGCTGGACG AACTCAAGGC CTTCTGCCCA
CAGGTGGAGA GCGACGTATT CGCCGTGCTC ACCGTCGAAG GCTCGCTGGC TTCCCGCAAC
CACATTGGCG GCACCGCGCC GGAGCAGGTC AGGGCCGCTG TTGCGCGCGC TCGCCAGCGC
CTGAGCTGA
 
Protein sequence
MTSNANQYTW AGRFSEPVSD LVKRYTASVD FDQRMWRQDI RGSLAHARML AKQGIIAAAD 
LADIERGMAI VTEEIESGKF EWSLDLEDVH LNIEKRLTAL VGDAGKRLHT GRSRNDQVAT
DIRLYLRDSI DDILVLIKAF RSALVDLAEK EAATPMPGFT HLQVAQPVTF GHHMLAYFEM
FGRDAERYAD CRKRVARLPL GAAALAGTTY PIDRAYVAEQ LGFEGVCENS LDAVSDRDFA
IEFTAACALL MMHISRLSEE LVMWMSPRIG FIQIADRFCT GSSIMPQKKN PDVPELARGK
TGRVYGQLMS LLTLMKSQPL AYNKDNQEDK EPLFDAVDTV TDTLRIFADM AGGITVRADN
MKAALTQGFA TATDLADYLV KKGLPFRDAH EAVGHAVKAA EQKGVDLPQL TLDELKAFCP
QVESDVFAVL TVEGSLASRN HIGGTAPEQV RAAVARARQR LS