Gene Saro_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2656 
Symbol 
ID3918430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2893230 
End bp2894366 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content67% 
IMG OID640445433 
ProductDNA polymerase IV 
Protein accessionYP_497926 
Protein GI87200669 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCTC GATCGGCAGA TGGCGAACCG GATCGCAGCG CGACCGGGCT GCGCAAGGTC 
ATCCATGTCG ACATGGACGC CTTCTTCGCC AGCGTCGAGC AACGCGACAA TCCGGACCTG
CGCGGAAAGC CGGTCGCGGT CGGCGGTTCC TCCGGGCGCG GGGTCGTTGC CGCGGCAAGT
TACGAGGCCC GCAAGTTCGG CGTACGATCG GCTATGCCTT CGGCGAGGGC CATCGTGCTG
TGTCCCGACC TGATCTTCTG CAGGCCGCGA TTCGATGTCT ATCGCTCCGT TTCCCAGCAG
ATCCGCGCCA TCTTCCTCGA CTACACGCCC CACGTCGAAC CCCTGTCGCT CGACGAGGCC
TATCTCGACG TGACCGATGA CGTGCGCGGC ATCGGTTCCG CCACGCGGAT CGCCGAACTC
ATCCGCCGGC GGATCAAGGC CGACACCGGG CTGACCGCCA GTGCCGGTGT GTCCTACAAC
AAGTTCCTCG CCAAGATCGC GAGCGACCAG AACAAGCCCG ACGGCATGTG CGTGATCCGG
CCCGGCGAGG GCGCGCAGTT CGTCGCCAGC CTTCCGGTGC GGCGCTTCCA CGGCATCGGA
CCGCGCGGTG CGGAAAAGAT GGCGGCGCTC GGGATAGAGA CGGGCGCGGA TCTGCGTGCC
AGGGACCTGC CTTTCCTGCG CCAGCATTTC GGCAGCCTCG CGGACTATCT CTACCGGGCG
GTGCGGGGCA TCGACCTGCG CCAGGTGAAG GCCGACAGGC CGCGCAAGTC GGTCGGCGCG
GAGCGGACGT TCGAGCGTGA CATTTCGTCC GGCCCGGCCT TGCGCGAAAC GCTGGAGCGC
ATCCTGGAGA TCGTGTGGGA TCGGATCGAG CGCAGCGGGG CCAGCGGTCG GACGGTCACC
CTCAAGATGA AATTCAACGA CTTCACCCCA ATTACCCGTG CCCGCTCCCT GCCGCGCCCG
ATCGCAGACA AGGAGGAATT TGCCCGGCTG TCGCGTGAAC TGCTCGATGC GCAACTGCCG
CTTGCCAAGC CGATCAGGCT GATGGGGCTG ACGCTGTCCG CTCTCGAGGG CGAGGAGCCG
GAAGAGGCCG AGGACGGTCC CTCCGGCGCA GCGCTTCAAG CAGAACTGCC CTTCTGA
 
Protein sequence
MDPRSADGEP DRSATGLRKV IHVDMDAFFA SVEQRDNPDL RGKPVAVGGS SGRGVVAAAS 
YEARKFGVRS AMPSARAIVL CPDLIFCRPR FDVYRSVSQQ IRAIFLDYTP HVEPLSLDEA
YLDVTDDVRG IGSATRIAEL IRRRIKADTG LTASAGVSYN KFLAKIASDQ NKPDGMCVIR
PGEGAQFVAS LPVRRFHGIG PRGAEKMAAL GIETGADLRA RDLPFLRQHF GSLADYLYRA
VRGIDLRQVK ADRPRKSVGA ERTFERDISS GPALRETLER ILEIVWDRIE RSGASGRTVT
LKMKFNDFTP ITRARSLPRP IADKEEFARL SRELLDAQLP LAKPIRLMGL TLSALEGEEP
EEAEDGPSGA ALQAELPF