Gene Saro_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1824 
Symbol 
ID3918384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1926312 
End bp1928192 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content64% 
IMG OID640444566 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_497098 
Protein GI87199841 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.484584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACA TCAACTCCAA GCTGGAAATC GGCGTCACCA CCGGGCCGAT CCGCGGCAGC 
CGCAAGATCC ACGTGGAAAG CGCACGCTTT CCCGGCTTGA CGGTGGCGAT GCGCGAAATC
CAGCTCGAGC CTTCGAGCGG CGAGCCGCCG GTACGCGTCT ACGATACGTC CGGTCCGTAC
ACCGACCCCA AAGTGACCAT CGACATCGCG GCGGGCCTGC CCACTCTGCG TCGCGACTGG
ATCATGGCGC GCGGCGACGT GGAAGAATAC GACGCGCGCG AAGTGAAGCC CGAGGACAAT
GGCCTCAAGG GTCCGGATCG TTCTGCCGGG GTGCCGCCGT TCCCCAACGT CGTGAAGCGC
CCTTTGCGTG CAAAGGCGGG CCAGAACGTC AGCCAGATGC ACTATGCCCG CCGCGGGATC
ATCACGCCCG AGATGGAATA CGTGGCGATC CGCGAGAACC TTGGCCGCAA GCAGGCCAAG
GAAGCGATGA TCCGTGATGG GCAGGACTGG GGTGCGTCGA TCCCCGATTA CGTGACGCCC
GAATTCGTGC GCGACGAAGT GGCGCGTGGG CGGGCGATCA TCCCGTCGAA CATCAACCAC
CCTGAAAGCG AACCGATGGC GATCGGCCGC AACTTCCTGG TGAAGATCAA CGCCAACATC
GGCAATTCGG CTGTGGCATC CGACGTCGCG AGCGAAGTCG ACAAGATGGT CTGGTCGATC
CGCTGGGGTG CGGATACCGT GATGGACCTT TCGACGGGGC GGAACATCCA CGACACCCGC
GAATGGATCC TGCGCAATTC GCCGGTGCCG ATCGGCACGG TGCCGATCTA CCAGGCGCTC
GAAAAAGTCG GCGGCGTGGC CGAGGACCTG ACCTGGGAAG TGTTCCGCGA CACGCTGATC
GAACAGGCCG AGCAGGGCGT GGACTACTTC ACCATCCATG CCGGCGTGCG CCTTCCCTAC
ATCCCGCTTG CCGCCAAGCG CATGACCGGG ATCGTCAGCC GTGGCGGTTC GATCATGGCC
AAGTGGTGCC TTGCGCATCA CAAGGAGTCG TTCCTTTACG AGAACTTCGA CGAGATCACC
GAGATCATGA AGGCCTATGA CGTGGCCTAT TCGCTGGGCG ATGGCCTGCG TCCCGGATCG
ATCTACGACG CCAACGACGA AGCACAGTTC GCCGAACTCT ACACGCTGGG CGAACTGACC
AAGCGTGCCT GGGAACAGGA CGTGCAGGTG ATGATCGAGG GCCCCGGCCA CGTGCCGATG
CACAAGATCA AGGAGAACAT GACGAAGCAG CTCGAGGCGT GCGGCGAAGC GCCGTTCTAC
ACGCTCGGGC CGCTCGTCAC CGACATCGCG CCGGGATACG ACCACATCAC CAGCGGCATC
GGTGCGGCAC AGATCGGCTG GTACGGCACG GCGATGCTCT GTTACGTCAC GCCCAAGGAG
CATCTCGGCC TGCCGGATCG CGATGACGTG AAGGTCGGCG TGGTGACCTA CAAGCTGGCA
GCCCACGCCG CCGACCTCGC CAAGGGCCAC CCTGCGGCAC AGGCGCGCGA TGACGCGCTG
AGCAAGGCGC GCTTCGAGTT CCGCTGGCGC GACCAGTTCA ACCTGTCGCT CGATCCGGAA
ACCGCCGAGC AGTACCACGA CCAGACGCTT CCGGCGGAAG GCGCAAAGAC CGCGCACTTC
TGCTCGATGT GCGGTCCGAA GTTCTGCTCG ATGAAGATCA GCCAGGAAGT TCGCGACTTT
GCCGCCAAGC AGAATGCCGG CATCGAGACG TTTGTCGCCA ACGAGGCCGA AGCCGAGGCC
GGCATGAAGG CGATGAGCGA CAAGTACGAC GAGATGGGCC GCGAACTGTA CATCGGCGCG
GGCGGGCGCG AGCACGACTG A
 
Protein sequence
MADINSKLEI GVTTGPIRGS RKIHVESARF PGLTVAMREI QLEPSSGEPP VRVYDTSGPY 
TDPKVTIDIA AGLPTLRRDW IMARGDVEEY DAREVKPEDN GLKGPDRSAG VPPFPNVVKR
PLRAKAGQNV SQMHYARRGI ITPEMEYVAI RENLGRKQAK EAMIRDGQDW GASIPDYVTP
EFVRDEVARG RAIIPSNINH PESEPMAIGR NFLVKINANI GNSAVASDVA SEVDKMVWSI
RWGADTVMDL STGRNIHDTR EWILRNSPVP IGTVPIYQAL EKVGGVAEDL TWEVFRDTLI
EQAEQGVDYF TIHAGVRLPY IPLAAKRMTG IVSRGGSIMA KWCLAHHKES FLYENFDEIT
EIMKAYDVAY SLGDGLRPGS IYDANDEAQF AELYTLGELT KRAWEQDVQV MIEGPGHVPM
HKIKENMTKQ LEACGEAPFY TLGPLVTDIA PGYDHITSGI GAAQIGWYGT AMLCYVTPKE
HLGLPDRDDV KVGVVTYKLA AHAADLAKGH PAAQARDDAL SKARFEFRWR DQFNLSLDPE
TAEQYHDQTL PAEGAKTAHF CSMCGPKFCS MKISQEVRDF AAKQNAGIET FVANEAEAEA
GMKAMSDKYD EMGRELYIGA GGREHD