Gene Daro_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2021 
Symbol 
ID3566968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2173910 
End bp2175487 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content54% 
IMG OID637680492 
ProductL-aspartate oxidase 
Protein accessionYP_285236 
Protein GI71907649 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0029] Aspartate oxidase 
TIGRFAM ID[TIGR00551] L-aspartate oxidase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.00800394 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGAAAT TTGATGTCCT GATCATCGGT AGCGGGCTTG CTGGCCAGTC TGCCGCATTG 
AGACTCGCCC CTCATTGCCG AGTGGTACTC GTCAGCAAGC GTAGCCTTGA AGACTCCGCA
TCCGGCTGGG CTCAAGGGGG GATTGCTGCG GTACTGGACA GCCAGGATTC CATCGAAGCG
CATATTCGGG ATACGCTGAT TGCCGGCGCT TGGTTGAACG ACGAAAAAGC TACCCGCCAT
GTCGTTGAAA ACGGGCGCCG AGTTATCGAG TGGCTGATTG AACAAGGCGT CCCTTTCACC
AAAGACGACT CTGGCTACCA CCTTACCCGC GAAGGAGGCC ACAGTGCTCG CCGTGTTATC
CACGTTGCTG ACGCGACTGG TCTTGCCGTA CAGGACACGT TAACCAAGAA AGTTCGCGCC
AATCCAAACA TTACCGTTCT GGAAGATCAC ATCGCGATCG ACCTGATCAC GGGAGACAAG
CTCGGTCAAA ACGATAAGCG CTGCTATGGC GCCTACATTC TGAATAATCG TAATGGTGAA
GTGATCACCA TTGGCGCCCA GAACACCCTT ATCGCCACAG GCGGTGCCGG CAAGGTTTAT
CTCTATACGA CCAACCCCGA TACCTCCACA GGTGATGGCA TAGCCATGGC TTATCGTGCG
GGTTGCCGGG TATCGAACAT GGAATTTATC CAGTTTCACC CGACCTGTCT CTATCACCCC
CAGGCTAAAT CATTCCTTAT ATCTGAAGCA GTACGTGGCG AAGGCGGGCT ACTCCGCCTG
CCGGACGGAA CTCGCTTCAT GCCTGAGCAC GATGACCGCG CAGAACTGGC ACCGCGTGAC
ATCGTCGCTC GTGCCATCGA CTTCGAAATG AAGAAACGCG GCCTCGATTG TGTCTTCCTC
GATATTTCAC ACAAGGATGA AGCGTTCATT CGCGGTCATT TCCCGAATAT TTACGCTCGC
TGCCTGGAAC TGGGTATTGA TATCACCCAG GAAGCGATTC CTGTCGTGCC TGCTGCCCAC
TACACCTGCG GCGGTATTGT CAGCGATCTT CACGGCCGCA CCGATGTGTC AGGACTGTAC
GTCGCCGGTG AAGCCTCCTG CACCGGACTG CACGGTGCCA ACCGCCTTGC TTCCAACTCA
TTGCTTGAAT GCCTAGTGTT TGCCGAAGCA GCGGTTAACG ACATCTTGAG CAAGAAATCC
GAAAAAATCC CCACATTGCC GCGCTGGGAT GAAAGCCGGG TGACTGATGC CGATGAAGAA
GTGGTCATAT CCCATAACTG GGATGAGCTC CGTCGCTTCA TGTGGGATTA CGTCGGTATT
GTCCGAACGA CCAAACGACT GAAACGGGCA AAACATCGCA TTGGTTTGTT GATGCGGGAA
ATTGACGAGT TTTATGCCAA TTTCCGGGTT AGCCATGACC TTATTGAGTT ACGCAATTTG
GTAGTAACCG CGGACTTGAT CGTGCGCTGT GCCATGTTGC GCAAAGAAAG CCGGGGACTA
CATTTTTCCC GAGATTTTCC TGATATGGCC AGCAAGCCGA AGAACACGGT CCTCAAGCGT
CGCCGACAAG CGGGCTGA
 
Protein sequence
MQKFDVLIIG SGLAGQSAAL RLAPHCRVVL VSKRSLEDSA SGWAQGGIAA VLDSQDSIEA 
HIRDTLIAGA WLNDEKATRH VVENGRRVIE WLIEQGVPFT KDDSGYHLTR EGGHSARRVI
HVADATGLAV QDTLTKKVRA NPNITVLEDH IAIDLITGDK LGQNDKRCYG AYILNNRNGE
VITIGAQNTL IATGGAGKVY LYTTNPDTST GDGIAMAYRA GCRVSNMEFI QFHPTCLYHP
QAKSFLISEA VRGEGGLLRL PDGTRFMPEH DDRAELAPRD IVARAIDFEM KKRGLDCVFL
DISHKDEAFI RGHFPNIYAR CLELGIDITQ EAIPVVPAAH YTCGGIVSDL HGRTDVSGLY
VAGEASCTGL HGANRLASNS LLECLVFAEA AVNDILSKKS EKIPTLPRWD ESRVTDADEE
VVISHNWDEL RRFMWDYVGI VRTTKRLKRA KHRIGLLMRE IDEFYANFRV SHDLIELRNL
VVTADLIVRC AMLRKESRGL HFSRDFPDMA SKPKNTVLKR RRQAG