Gene Daro_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3021 
Symbol 
ID3568689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3266400 
End bp3267863 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content59% 
IMG OID637681492 
Producthypothetical protein 
Protein accessionYP_286221 
Protein GI71908634 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.0262026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.733965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTG CTGAAACCTG TACGAGAGCA TCCTCAGCGA CGAGTGGAAA AATGAACTTC 
TATAACGAGA TGTATGACGC CAATGGCGGT GTCCGTGCGC ACTACAAGGG GTACGAAGAC
TGGCTCAAGG CAACTCCACC CGAGCGCATT GAGCGTAAGC GGGCCGAAGC CGATCTGGCC
TTTCATCGGG TCGGTATTAC CTTTGCGGTC TATGGCGAGG AGGCTGGCAA GGAGCGCCTG
ATTCCCTTCG ACATCATTCC CCGCGTCATT CCATCGACTG AGTGGAAGGC GCTGCAGTCC
GGGCTGCGCC AGCGCGTCAA GGCGTTGAAC ATGTTCCTGT GGGACGTCTA CCACGATCAG
GAGATTCTAA AGGCCGGCAT CATTCCTGCC GAGCAGGTGC TGAACAACGC GCAGTATCGT
CCGGTCATGA AAGGTGTTGA TGTGCCGGGC GGGATTTACG CGCACATCAC CGGCGTTGAT
ATTGTGCGGG CCGGTGAGGG CGAGTTCTAC GTGCTGGAAG ACAATCTGCG CGTACCGTCC
GGCGTGTCAT ACATGCTTGA AGATCGCAAG ATGATGATGC GTCTTTTCCC CGAACTGTTT
GCCAAGCACA AGGTGGCACC GGTTCAGCAT TACCCGGACA TGCTGCTGGA GAAACTGCGC
GCCGTGGCGC CACAGGGTGT ATCGAACCCG ACAGTCGTCG TGCTGACGCC GGGTGCCTAC
AACAGCGCCT ATTTCGAACA CACCTTCCTC GCCCAGCAGA TGGGCGTCGA GTTGGTCGAA
GGTCGTGACC TGTTCGTCAA GGACGAAGTG GTCTATATGC GGACGACGCA GGGGCCGCAG
CGGGTTGATG TGATCTACCG CCGCCTCGAC GATGACTTCA TGGACCCGAC AGTCTTCCGC
GAAGATTCAT CGCTCGGCGT GCCGGGCATC ATCCGAGCCT ATCAGGCCGG CAATGTGACG
CTGGCGAACG CGGTTGGCAC TGGTGTCGCC GATGACAAGT CGATCTATCC CTACGTGCCG
GAAATGATTC GCTTCTACCT CGGTGAGGAA CCGAAGCTGA ATAATGTACC GACCTACATG
TGCCGCAAGC CGGATGATCT GGCCTACGTG CTTGATCACC TGCCGGAACT GGTGGTCAAG
GAAGTGCATG GCGCCGGTGG TTACGGCATG TTGGTCGGCC CGGCTTCGAC CAAGGAGCAG
ATCGAACATT TCCGCAAGTT GCTGATCGAC AAGCCGGATG GCTACATTGC CCAGCCGACG
CTGGCGCTGT CCAACTGTCC GACTTTCGTC GAAGAGGGCA TCGCGCCGCG CCACCTTGAC
CTGCGCCCCT TCGTCCTGTC GTCTGGAGAG TGCGTGAACA TGGTGCCTGG CGGCCTGACT
CGCGTCGCGC TGACCAAGGG CTCGCTGGTC GTGAATTCGT CGCAGGGCGG CGGTACCAAA
GACACCTGGG TTCTGGAGGA TTAA
 
Protein sequence
MEPAETCTRA SSATSGKMNF YNEMYDANGG VRAHYKGYED WLKATPPERI ERKRAEADLA 
FHRVGITFAV YGEEAGKERL IPFDIIPRVI PSTEWKALQS GLRQRVKALN MFLWDVYHDQ
EILKAGIIPA EQVLNNAQYR PVMKGVDVPG GIYAHITGVD IVRAGEGEFY VLEDNLRVPS
GVSYMLEDRK MMMRLFPELF AKHKVAPVQH YPDMLLEKLR AVAPQGVSNP TVVVLTPGAY
NSAYFEHTFL AQQMGVELVE GRDLFVKDEV VYMRTTQGPQ RVDVIYRRLD DDFMDPTVFR
EDSSLGVPGI IRAYQAGNVT LANAVGTGVA DDKSIYPYVP EMIRFYLGEE PKLNNVPTYM
CRKPDDLAYV LDHLPELVVK EVHGAGGYGM LVGPASTKEQ IEHFRKLLID KPDGYIAQPT
LALSNCPTFV EEGIAPRHLD LRPFVLSSGE CVNMVPGGLT RVALTKGSLV VNSSQGGGTK
DTWVLED