Gene Daud_0345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0345 
Symbol 
ID6025429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp368791 
End bp370170 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content67% 
IMG OID641593197 
Productargininosuccinate lyase 
Protein accessionYP_001716535 
Protein GI169830553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGAT TGTGGGGCGG ACGGTTCCGC AAGGAAACCG ACCCGCTGGT GGAGGAGTTC 
CACTCGTCTC TATCCTTTGA CCGGCGCCTG TACGCGTACG ACATCCGGGG CAGCATTGCC
CACGCGCGCA TGCTCGGGCG GGTGGGGATC ATCAGCGCCG AAGAGGCCCG GACCCTGGAG
GCCGGCCTGT ACGCCGTCCT CGATGATTTC AACGCCGGCC GGGTGGCTTT TTCCCCGGAG
GACGAGGACA TCCACAGCTT GGTGGAACGG CTGCTGATCG CCCGGGTCGG GGAGGTGGGC
AAAAAACTGC ACACCGCCCG CAGCCGCAAC GACCAGGTGG CGCTGGACGT CCGGATGTAT
CTGAAGGACG AGATCGACGC CGTCCGGGAA CTGCTGGCCG AACTCCAGCA CACCCTCCTG
GATCTGGCCG AGCGGCACAT CGAGACGCTC CTGCCGGGTT ACACCCATCT CCAACGGGCG
CAGCCGGTGA CCCTGGCCCA CCACCTGCTG GCCTACGTGG AAATGTTCCA CCGGGACGCG
GAGCGGCTTG CCGACTGCCG CCGGCGGACC GACGTGCTGC CGCTGGGCGC CGGAGCCCTC
GCCGGGACCG TTTTCCCGAT CGACCGGGAA TACACGGCCG CCGAACTCGG GTTCGCCGCC
CTGGCGGAAA ACAGCCTGGA CGCCGTGTCG GACCGCGACT TCGCGGTCGA GTTCTGCGCC
GCGGCCGCCC TGATCATGGT GCACCTCTCG CGGTTCTGCG AGGAATTGGT GCTGTGGTCC
ACCGCCGAAT TCGGCTTCGC CGAGATGGAC GACGCCTTCG CCACCGGGAG CAGTATGATG
CCCCAGAAGA AAAACCCGGA CATGGCCGAA CTGATCCGGG GCAAGTCGGG CCGGGTGTTC
GGCGACCTGC AGGCGCTCCT GGCCATGCTG AAGGGGTTGC CGCTCGCCTA CAACAAGGAC
ATGCAGGAGG ACAAGGAGGC ACTGTTCGAC GCCGTGGACA CGGTGAAGAA GTGTCTTATG
GTGTTCACCG CAATGATCGG GACGGTCAGC TTCCGGGAGC AGGCCATGGA CCGGGCGGTC
CGGGGCGGTT TCACCAACGC CACCGACCTG GCAGACTACC TGGCCGGACG GGGGGTACCG
TTCCGCGAGG CCCACGAGAT CGTCGGAGAA ATAGTCCTTT ACGCCCTGGA GACCGGCAAG
ACCCTCGAAG AACTCACGCT GGAGGAGTAC CGCCGTTTTT CCGCGGCGGT CGGGGAAGAC
GTGTACGCGG CCATCCGGGT CGAGCATTGC CTCGCCGCCC GTAAGGTGCA CGGCGGCCCC
GCCCCCGAAA CGGTGCGCGC CGCCATCGCC CGCGCCCGCA AGCGCCTGGA GAGAGTCTAG
 
Protein sequence
MSRLWGGRFR KETDPLVEEF HSSLSFDRRL YAYDIRGSIA HARMLGRVGI ISAEEARTLE 
AGLYAVLDDF NAGRVAFSPE DEDIHSLVER LLIARVGEVG KKLHTARSRN DQVALDVRMY
LKDEIDAVRE LLAELQHTLL DLAERHIETL LPGYTHLQRA QPVTLAHHLL AYVEMFHRDA
ERLADCRRRT DVLPLGAGAL AGTVFPIDRE YTAAELGFAA LAENSLDAVS DRDFAVEFCA
AAALIMVHLS RFCEELVLWS TAEFGFAEMD DAFATGSSMM PQKKNPDMAE LIRGKSGRVF
GDLQALLAML KGLPLAYNKD MQEDKEALFD AVDTVKKCLM VFTAMIGTVS FREQAMDRAV
RGGFTNATDL ADYLAGRGVP FREAHEIVGE IVLYALETGK TLEELTLEEY RRFSAAVGED
VYAAIRVEHC LAARKVHGGP APETVRAAIA RARKRLERV