Gene Daro_3552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3552 
SymbolpurT 
ID3566364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3804991 
End bp3806190 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID637682025 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_286751 
Protein GI71909164 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.229559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG GCACCCCGCT TTCCCCCTCT GCCCTGCGCG TCATGCTGCT CGGTGCCGGC 
GAACTCGGCA AGGAAGTGAT CATTGCCTTG CAACGCCTGG GCGTCGAAGT GATCGCTGTC
GACCGCTACG AGAACGCCCC TGGCCATCAG GTGGCTCACC GCGCCCACGT CATCTCGATG
ACCGATGGTG CCGCGCTGCG CCAGTTGGTC GAACAGGAAA AGCCGCACCT GATCGTGCCG
GAAATCGAAG CCATCGCCAC CGACATGCTG GTCGAGATCG AAGCGGCCGG CTTGGCCGAA
GTAATCCCGA CCGCCCGCGC CGCCAAGCTG ACCATGAACC GCGAAGGCAT CCGCCGGCTG
GCCGCCGAGG AACTTGGCCT GCCGACCTCG CCCTACAAAT TTGCGGATTC CCTGGCTGAA
TTGCAGGCTG CCATCGATGG CGGCATTGGC TATCCCTGCA TCGTCAAGCC GACCATGTCG
TCGTCCGGCA AAGGTCAGTC GCTGCTGCGC GGCCCGGACG ATGTCCAGAA GGCCTGGGAT
TACGCGGCCA GCGGTGGCCG CGTCAATCAG GGCCGGGTCA TCGTCGAAGG CTTCATCGAC
TTCGATTATG AAATCACCCT GCTCACCGTC CGCGCCCGCG ATACAGCCGG CGAAGTGGTG
ACTCACTTCT GCGAGCCGAT CGGCCACGTG CAGGTCGGCG GCGACTATGT CGAATCTTGG
CAGCCGCAGG CGATGAGCCC GGCTGCGCTG CAACGGGCGC AGGAAATTGC CGCCGCCGTG
ACCGGCAACC TCGGCGGTCG TGGCCTGTTC GGCGTCGAAC TGTTCGTCAA GGGCGACATG
GTCTGGTTCT CCGAAGTCAG CCCCCGGCCG CACGACACCG GGCTGGTCAC GCTGTGCTCG
CAGCGCTTCT CGGAATTCGA ACTGCACGCC CGCGCCATCC TCGGTTTGCC GGTCGATACC
GCGCTGCGCG AAGCGGGTGC TTCGGCCGTC ATTTACGGCG GCATGGAAGA AAAAGGCATT
GCCTTCGCGG GGCTGGAAGA GGCCCTGGCC GTACCGCGCA GCGACCTGCG CCTGTTCGGC
AAGCCGGAGT CCTTCAAGAA ACGCCGCATG GGCGTGGCCG TCGCCAATGG CGAGAGCACC
GACCAGGCCC GCGAACGGGC CAAGCTGGCG GCGAGCAAGG TTCGTCCGAC CAGAACCTGA
 
Protein sequence
MKIGTPLSPS ALRVMLLGAG ELGKEVIIAL QRLGVEVIAV DRYENAPGHQ VAHRAHVISM 
TDGAALRQLV EQEKPHLIVP EIEAIATDML VEIEAAGLAE VIPTARAAKL TMNREGIRRL
AAEELGLPTS PYKFADSLAE LQAAIDGGIG YPCIVKPTMS SSGKGQSLLR GPDDVQKAWD
YAASGGRVNQ GRVIVEGFID FDYEITLLTV RARDTAGEVV THFCEPIGHV QVGGDYVESW
QPQAMSPAAL QRAQEIAAAV TGNLGGRGLF GVELFVKGDM VWFSEVSPRP HDTGLVTLCS
QRFSEFELHA RAILGLPVDT ALREAGASAV IYGGMEEKGI AFAGLEEALA VPRSDLRLFG
KPESFKKRRM GVAVANGEST DQARERAKLA ASKVRPTRT