Gene Saro_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0642 
Symbol 
ID3918067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp681134 
End bp682756 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content62% 
IMG OID640443373 
ProductPhage or plasmid primase P4-like 
Protein accessionYP_495923 
Protein GI87198666 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0349968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGC TTTCCTTGCT CGAGCGTTCG CGCCTTCCGA CCAATGATCT TGGCAATGCG 
CGCCGGCTGT TCGAGGCAGC AAACGGACGC CTGTTGTGGT TGGCCGATGG CGCCGGCGGC
AAGGGTTGCT GGATCGCTTT CGATGGCATC CGCTGGTCGG CCGACGAAGG TCCGATGCGC
GCGCTGGCAT TTGCCCAGAA GGCAGCGGTC GAGATCTGCG ACGAGGCGCA CGCCTTGCGC
GAATGCACGG CTGACGAGTT GGCCGAGGTC TATGGGCGCA AGTTCTCCAA AGAGATGGCC
GAGGAAAGAG CGGGACAGCT TTGGACCTGG TCGATCAAGT CGGGCGATTC CGCCAAGACG
ACTGCAATGC AGAACCAGTT CAAGGGGCTC CGCGACGGTG ATGAGGGTCC GTTCGTCACA
CAGGTATGGC AGCGCGACTT CGATGCGCAG CCCATGGCCT ACCACTGCAG CAACGGAACC
TTGCGGTTCG TCCAGGATGA CGCCGGGACT TGGAGCCACG TTTTCGAGAA GGGGCACCGG
CCAGACGATC GGTTCATGCA GGTGGCAAAC GTCGCCTATG ATGCCGCAGC CAAGGCGAAG
GCATGGATCG AGCGCATGGA AGTGATGCAC CATGATCCGG TGCAGCGGAC TGCCCTGCAG
CGGATCTACG GCATGACGCT GACTGCGCTG ATATCCGACC AGGCGTTCTA CATCTTCCAA
GGCAAGGGGC AGGACGGCAA GTCCGTCACC AATGACGTCG TCTGCCAGCT CCATGGCATG
TACGCCCGCA AGGCCGACCC GAAGACCTTC CTCGAGGGAC CGACTCAGCA GAGCAGCGGG
CCTCAGAGCG ACATCGTGCG TCTTGCCGGC GACGTCCGCC TGGTGGTGAT GGACGAACCA
AAGAAGAACA GCACGTGGGA CGGCCAGAAG ATCAAGCAGG CCACGGGTAG CGAGATGATC
GCGCGTGGCG TGCATGCGAC CACGGAATTG AGCTTTACGC CGCACTGGCA GCTCATCGCC
GAGTGCAACG GCCTGCCCAA GGCACCGAGC GACGATCGCG GCTTCAGGCG TCGCTTCAAG
CTCTATCCTT GGGTTGTCCA GTTCGGCGTC ACGCCTGGTG TTGCTGATGA GCCGGTGCAC
CTGGTGAAGG CACGCCTGAT CGGAGAAGGA TCGGGCGTTC TCAACTGGAT GATCAAGGGC
TGCGTCGAAT GGCTGAATGA ACGCGTTGTG CCGGAGCCGG AGGCCGCCAA GCGCGCGACG
GCGAGCTTCT GGTCTGCCAG CTCGGCCATG GGCGAGTGGA TCGCCTCACA CTGCGACCTG
TCCGATCCTG AAGCCCGCGA GGAAGCGACG CCGCTGTACA AGGCGTTCCG GCAATTCTGC
ATAGATCGCG GCGACGATGA AACGAAGATC ATCACCCAGA CGACTTTTGG GCGGCAGTTG
AACGATGCGC AGATCTATCG CGTTCCGAAC AATTCGACCG GCAAGGTTGA GCGTGTGGGC
ATCCGCCTCA AGCGCGTTGA TGAGCTGGGT GGCGGTGCGC TGTCCACCAG TGGCCGCGAC
ACCTTCGACG ATGATGTCGC GCGCTTCGAC GCTGACAATC GCGACCCCTT CGGCGCGCCG
TGA
 
Protein sequence
MKRLSLLERS RLPTNDLGNA RRLFEAANGR LLWLADGAGG KGCWIAFDGI RWSADEGPMR 
ALAFAQKAAV EICDEAHALR ECTADELAEV YGRKFSKEMA EERAGQLWTW SIKSGDSAKT
TAMQNQFKGL RDGDEGPFVT QVWQRDFDAQ PMAYHCSNGT LRFVQDDAGT WSHVFEKGHR
PDDRFMQVAN VAYDAAAKAK AWIERMEVMH HDPVQRTALQ RIYGMTLTAL ISDQAFYIFQ
GKGQDGKSVT NDVVCQLHGM YARKADPKTF LEGPTQQSSG PQSDIVRLAG DVRLVVMDEP
KKNSTWDGQK IKQATGSEMI ARGVHATTEL SFTPHWQLIA ECNGLPKAPS DDRGFRRRFK
LYPWVVQFGV TPGVADEPVH LVKARLIGEG SGVLNWMIKG CVEWLNERVV PEPEAAKRAT
ASFWSASSAM GEWIASHCDL SDPEAREEAT PLYKAFRQFC IDRGDDETKI ITQTTFGRQL
NDAQIYRVPN NSTGKVERVG IRLKRVDELG GGALSTSGRD TFDDDVARFD ADNRDPFGAP