Gene Daro_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3098 
Symbol 
ID3568495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3338260 
End bp3339558 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content61% 
IMG OID637681569 
Productisocitrate lyase 
Protein accessionYP_286298 
Protein GI71908711 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.183218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTC GCGAACAGCA AATCGCCGCC CTCGAAAAAG ACTGGGCTGA AAACCCCCGC 
TGGAAAGGCA TCAAGCGCGG TTATTCCGCT GCTGACGTCG TCCGTCTGCG TGGTTCTTTC
CAGGTCGAGC ACACCTTGGC CCGCCGTGGC GCCGAAAAGC TGTGGGATCT GGTCAACAAC
ACCCCCTACG TCAACTGCCT GGGCGCCCTG ACCGGCGGTC AAGCCGTTCA GCAAGCCAAG
GCTGGCATCA AGGCCATCTA CCTGTCCGGC TGGCAAGTTG CTGCTGACAA CAACGAATAC
GCTGCCATGT ACCCGGATCA GTCCCTGTAC CCGGTTGACT CCGTGCCGAA GGTTGTCGAA
CGCATCAACA ACTCCTTCAA CCGCGCCGAC GAAATCCAGT GGTCCAAGAA CATCAACGCT
GGCGATGCCG GTCACGTCGA ATACCACCTG CCGATCGTTG CTGACGCTGA AGCCGGTTTC
GGCGGCGTGC TGAACGCCTA CGAACTGATG AAGGCCATGA TCCGCGCTGG CGCTGCTGGC
GTGCATTGGG AAGACCAGCT GGCTTCCGTC AAGAAGTGCG GCCACATGGG CGGCAAGGTT
CTGGTTCCGA CCACCGAAGC TGTTCAGAAG CTGATCGCTG CCCGTATGGC TGCCGACGTC
TACGGCGTGC CGACCCTGGT CATCGCCCGT ACCGATGCCG AAGCTGCTGA CCTGCTGACC
TCCGACTACG ACGAGAACGA CAAGCCGTTC TTGACCGGCG AGCGCACCGC CGAAGGCTTC
TACAAGACCC GCAAGGGCCT GGATCAAGCC ATCTCCCGCG CCATCGCCTA CGCTGACTAC
GCCGATCTGG TGTGGTGCGA AACCGGCACG CCGGATCTGG AATACGCCCG CAAGTTCGCC
GAAGCCGTGC ATAAGGTTCA TCCGGGCAAG ATGCTGGCCT ACAACTGCTC GCCTTCCTTC
AACTGGAAGA AGAACCTGGA CGACGCCACC ATTGCCAAGT TCCAGAAGGA ACTGGGCGCC
ATGGGCTACA AGTACCAGTT CATCACCCTG GCTGGCATCC ACTCCATGTG GTACAACATG
TTCGATCTGG CCCAGGACTA CGCCGCCCGC GGTATGTCGG CCTACGTCGA GAAGGTTCAG
GAGCCGGAAT TCGCTGCCCG CGACCGTGGC TACACCTTCG TGTCGCACCA GCAGGAAGTC
GGTACCGGTT ACTTCGACGA CGTCACCACC GTGATCCAGG GTGGCAAGTC CAGCGTCACC
GCGCTGACCG GCTCGACCGA AGAAGAACAG TTCCACTAA
 
Protein sequence
MSTREQQIAA LEKDWAENPR WKGIKRGYSA ADVVRLRGSF QVEHTLARRG AEKLWDLVNN 
TPYVNCLGAL TGGQAVQQAK AGIKAIYLSG WQVAADNNEY AAMYPDQSLY PVDSVPKVVE
RINNSFNRAD EIQWSKNINA GDAGHVEYHL PIVADAEAGF GGVLNAYELM KAMIRAGAAG
VHWEDQLASV KKCGHMGGKV LVPTTEAVQK LIAARMAADV YGVPTLVIAR TDAEAADLLT
SDYDENDKPF LTGERTAEGF YKTRKGLDQA ISRAIAYADY ADLVWCETGT PDLEYARKFA
EAVHKVHPGK MLAYNCSPSF NWKKNLDDAT IAKFQKELGA MGYKYQFITL AGIHSMWYNM
FDLAQDYAAR GMSAYVEKVQ EPEFAARDRG YTFVSHQQEV GTGYFDDVTT VIQGGKSSVT
ALTGSTEEEQ FH