Gene Saro_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0444 
Symbol 
ID3918312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp485699 
End bp487273 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content66% 
IMG OID640443173 
Productprotein of unknown function DUF853, NPT hydrolase putative 
Protein accessionYP_495726 
Protein GI87198469 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0693836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATA TCTACCTCGG TCTGGGTTCG AACGGCGAAC GGCAGGTGCT GCGCCTGGGC 
CGGGCAAACC GCCACGGGCT CATCGCGGGG GCGACCGGCA CCGGCAAGAC CGTGACCTTG
CAAGGCATTG CCGAACAGTT TTCGGCGCGC GGCGTACCCG TGTTCATGGC CGACGTGAAA
GGCGACCTTG CGGGCATAGC CATGCCTGGC AGCCCCACGT TCAAGCATGC CGCCTCGCTG
GAGGCGCGCG CGAAGGAACT GGGCATCGCC GACTACGCCT ATTCCGACAA TCCGGCGGTG
TTCTGGGATC TCTACGGCGA AAGCGGGCAC CCCATCCGCA CCACCATTTC GGAAATGGGG
CCATTGCTGC TGGCGCGCCT GATGGGGCTC AACGAGACGC AGGAAGGCGT GCTCAACATC
GCGTTCCGCT ATGCCGACGA CAACGGCTTG CTGCTGATCG ACCTTGCGGA TCTGCAATCG
GTGCTGGTTG CCTGCGCCGA GAACGCGAGC GAGCTTGCCA CGCGCTACGG CAATGTCTCG
AAGGCCAGCG TGGGCACGAT CCAGCGCCAG CTCCTTGCCT TCGAAAGCCA GGGGGCAGAT
CGCTTCTTTG GCGAGCCTGC CTTCGAGATC AACGATTTCC TCAAGGTGGA CGAGCAGGGA
CGCGGCATGG TCAACGTGCT GGCCGCCGAG AAGCTGATGC AGAGCCCCAA GCTCTACGCG
ACTTTCCTGC TGTGGCTTCT GTCCGAACTG TTCGAGGCGC TGCCCGAAGT GGGCGACCCG
GAGAAGCCGT GCCTCGTGTT CTTCTTCGAC GAGGCGCACC TCCTGTTCGA GGACGCGCCA
CAGGCGCTGA TGGAAAAGGT CGAGCAGGTC GTCCGCCTGA TCCGCTCGAA GGGCGTCGGT
GTGTTCTTCG TCACGCAGAA CCCGATCGAC ATTCCCGAGA AGATCGCGGG CCAGCTCGGC
AACCGGGTGC AGCACGCCCT GCGCGCCTTC ACCCCGCGCG ACCAGAAGGC GATCCGCGCC
GCCGCCGAGA CTTTTCGCAT CAACAAGGAT CTCGATGTCG AGACGGTCAT CACCGAACTG
AAGGTGGGCG AGGCGCTGGT TTCCACGCTG GAGGAGGACG GTGCGCCCTC GGTGGTCCAG
CGCACGCTGA TCGCGCCGCC CCGCTCGCGC CTCGGACCGC TGACGCCGAA GGAGCGCGCA
ATCATCCAGT CCGTCAGCCC GTTCGATGGA AAGTACGACA CGGCGGTGAA CCGCGAATCC
GCCGCAGAGA TTCTCGCACG CAAGGCATCT GACGCCGCCG CCGCCGCGCA GCAGATCGAG
GCCGAGGGCG AGGACAGCCA GCGCGCCTCG GCGCGGAGGT CGCCTTCGAT GTGGGAGCGC
GCCGGAAAGG CCGCCGCAGG GGCAGTGGCA TCGTCGGCCG GCGCGGTCAT CGCGGCGCAG
ATCACCGGCA AGAAATCGCG CGCGGCGCCG ATGGCTTCGG GGATCACGGC GATGGCGGGC
TCTATCGCCT CTTCAATCGG CGGCGAGGCC TTCGGGCGAT TTGCCCGCGG AATCCTGGGC
GGGCTGCTGC GCTAG
 
Protein sequence
MDDIYLGLGS NGERQVLRLG RANRHGLIAG ATGTGKTVTL QGIAEQFSAR GVPVFMADVK 
GDLAGIAMPG SPTFKHAASL EARAKELGIA DYAYSDNPAV FWDLYGESGH PIRTTISEMG
PLLLARLMGL NETQEGVLNI AFRYADDNGL LLIDLADLQS VLVACAENAS ELATRYGNVS
KASVGTIQRQ LLAFESQGAD RFFGEPAFEI NDFLKVDEQG RGMVNVLAAE KLMQSPKLYA
TFLLWLLSEL FEALPEVGDP EKPCLVFFFD EAHLLFEDAP QALMEKVEQV VRLIRSKGVG
VFFVTQNPID IPEKIAGQLG NRVQHALRAF TPRDQKAIRA AAETFRINKD LDVETVITEL
KVGEALVSTL EEDGAPSVVQ RTLIAPPRSR LGPLTPKERA IIQSVSPFDG KYDTAVNRES
AAEILARKAS DAAAAAQQIE AEGEDSQRAS ARRSPSMWER AGKAAAGAVA SSAGAVIAAQ
ITGKKSRAAP MASGITAMAG SIASSIGGEA FGRFARGILG GLLR