Gene Daro_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0019 
Symbol 
ID3570043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp24827 
End bp25903 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID637678448 
ProductSMF protein 
Protein accessionYP_283248 
Protein GI71905661 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.108355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCATA ACGACAGTCT GGCCGCCTGG TTGCGGCTGA CCCTGATCCC GGGCATCGGC 
GGCGAGACGC AAAGAAAGCT TCTCGCCGCT TTCGGTTTAC CGGAAGCCAT TTTCTCGGCT
GGCCGCCTGG AAGCACGCGG CGTCATCGGT AACCGCGCCG ATCTGCTGTT CGATTTTGAT
CCGACGGAAG CGGTAGCACA CAGTCTCGAA TGGGCCAGGC AACCGGGGCA ACACATCATC
TCGCTGGCCG ACGAAGCCTA CCCGAAAGCA CTGCTCGAAA TAGCCGACCC GCCCAGCCTG
CTCTACGTAC GCGGCAACCT AGCCCTGCTC CAGAAGCGCG GACTGGCCAT GGTTGGTAGC
CGCAATGCAA CACCGCAAGG CGTGCAAACC GCCGAAAACT TCGCCAAAAC GCTGGCCGCC
AAGGGTCTGA CAATCATTAG CGGACTGGCA CTGGGGATTG ATGCCGCCGC CCACCGTGGC
GCCCTGGCTG CCAAGGGGGA AACCATCGCG GTGATCGGCA CCGGGCCCGA CCGCATCTAC
CCGGCACGCA ACAAGGAGCT GGCTTTGGCG ATTGTCGAAT CCGGTGCGAT CGTTTCCGAA
TTCCCGCTCG GCACACCGGC CATCGCTTCA AATTTCCCAA GGCGCAATCG GATCATTTCC
GGACTATCGT GCGGCGTACT GGTGGTCGAA GCGGCGCCGG AAAGTGGCTC GCTGATCACG
GCGCGGCTTG CCGCAGAGCA GGGGCGTGAA GTTTTCGCCA TTCCCGGCTC GATCCACTCA
CCAGTTGCTC GTGGTTGCCA CAAATTGATC AAGCAGGGTG CCAAGCTGGT TGAAACCGCT
ACCGACATCC TGGAGGAGCT GGGCAGTTTC AACGCAGCTC CCGCAGCAGA CATCCCATCG
GATAAGGCCG ATGAAGGGCC GATTCTCACT GCACTTGGCC ACGATCCATG CAGCCTTGAC
GACCTCGTCG AACGAACCAC CATGAGCGCC GATCAGTTAC TGCCGGAACT CCTGACACTG
GAGCTTTGCG GCCTGATCGC CACCCTGCCC GGTAACCGCT ACCAGCGCCT GAACTAG
 
Protein sequence
MSHNDSLAAW LRLTLIPGIG GETQRKLLAA FGLPEAIFSA GRLEARGVIG NRADLLFDFD 
PTEAVAHSLE WARQPGQHII SLADEAYPKA LLEIADPPSL LYVRGNLALL QKRGLAMVGS
RNATPQGVQT AENFAKTLAA KGLTIISGLA LGIDAAAHRG ALAAKGETIA VIGTGPDRIY
PARNKELALA IVESGAIVSE FPLGTPAIAS NFPRRNRIIS GLSCGVLVVE AAPESGSLIT
ARLAAEQGRE VFAIPGSIHS PVARGCHKLI KQGAKLVETA TDILEELGSF NAAPAADIPS
DKADEGPILT ALGHDPCSLD DLVERTTMSA DQLLPELLTL ELCGLIATLP GNRYQRLN