Gene Saro_0105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0105 
Symbol 
ID3915991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp107549 
End bp109435 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content68% 
IMG OID640442830 
Productheparinase II/III-like 
Protein accessionYP_495388 
Protein GI87198131 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA GCGGCTTCCC CGTCATCGGA AGCCGCCAGG CGGCCAAGGC GCCGCACGAG 
TCGGCGATTC CGCTGGCAGG CTCGAGGGAG GAACCGCTGG TCGATATCGC TGACAGCTAT
CTTCCCCTCG ACGAGGACCA GACCGCCGCT CCCATCGCCG ATCCGACCGC GGTGGAGCCG
GGACGTTCGC TCGCGCTTGC CGATTTCGCC CCGCCCGCAC TTGGCGCGGG AGACCGGCTG
GTCCGGCTCG CCTATCGGAT GGGCCTTCCC GCGAGCGCGA TCCATCCCTT CCGCAAGCGG
GCGAAGACAA GGCTGACGGC AACAGTTACG CCGCCCCTCC CCGGAGATCC GGCGGCCGGC
AAGGCACTGC GTGCGGGGCA TTTCCTAGTC CACGGCTTCA AGTCGCCAAT TGCCGACACC
GCATTTTCGG GCCCGCGCCT GCCACCGCCG TTCGAACGGA TGGTTCACGG CTTTCGGTGG
TTGCGCGATC TCGAGTCGGG CGGGACGCGG GCGCAGTGCA CGCAAGTGGC CGAACGCATC
CTGGCAACCT GGCTGAAGGC AAACCCCAAG CCGAACCCGA CGCCCGCATG GGATGTGGGC
AACGTCGGCC ATCGCCTGCT CAACTGGATG ATCCATGCGC CGCTCGTCCT GTCTGGCCAG
GACCGCGGCT TCCGCAGCCG CATGCTGCAC ACGATCGAGG ATACCGCACG CTGGCTCGAC
CGGCACGTCG CCAAGGCTGA CGACCGGCTA GGTGAAGTGG CGGGCTGGTG CGCCATCGTT
GCCACCGGCC TGCTCATGGC CGACGGAAAG CCACGCCGCC TCTATGGCGA AGCCGGCCTA
GTACGCGCGC TGGGCGAACT GGTCAGCGAC GATGGCGGCG TATTGTCGCG CAGCCCGCTC
TGCCAGATCG AGGCGATAGA ACTGCTCGTC AGCCTGCGCG CCTGCTACGA CGCGATACGG
TCGGAACCGC TGCCGCAGAT CGGGACCATG CTGAACCTGC TGGTTCCGCC GCTGCTAGCG
CTGCTGCATG GCGATGGCGG ACTGGGCAAC TGGCAGGGCG CAGGGGCCAT CGAGGCTGAC
CGGATCGAGG AACTGGTCCG GGCAACCGGC GTGCGCACCC GTCCGCTGCG CGATGCACGC
CAGTGGGGTT ACCAGCGCGC AACGGCAGGC AAGGCCGTGC TCCAGTTCGA TGCAGGGCCA
CCACCCGTGG CCCGCCACGC GCGCGACGGA TGCGCTTCTA CCCTGGCTTT CGAGTTCAGC
CATGGACCGG ACCGGCTCAT CGTCAACTGC GGCGGCGCGG CGTTTGCCGG CGGACTGATT
CCCCTGCGGC TTGAGCAGGG CCTGCGCGCG ACGGCTGCGC ATTCGACGCT GACCATCGAC
GATTTCAACT CAACCGCAGT CCTCATCAAC GGTCGCCTCG GTTCGGGCGT TTCGGAGGTC
GAGGTCGACA GGCGTACGCT TTCCGCCGAC GGCAACGGTC CGGGCGCGAC GCGCATCGAG
GCCAGCCACA ACGGCTATGT GGGGCGCTAT GGCCTGACCC ATCGCCGCAT CCTGATCCTG
CGCGACGATG GCAGCGAACT GCGCGGCGAA GACCTTCTGG TGCCAGCAGG GCGCAAGGGC
AAGCGCGGAA CCATCGGCGT CGCCTTGCGA TTCCATCTCG GTCCGCATAT CGAGCTCGCC
ACCAGTGCGG ACGGGAAAGG CGTGACGCTC GCCCTGCCCG ACGGGAGCCT GTGGCAGTTC
CGCTCGGGCC GCGATGCGGT GTCGGTCGAG GAAAGCCTCT GGGCAGACGG GCAGGGACGC
CCGCTGGCAA CGCGCCAGCT TGTCGTCACA GCCAAGGTTC CACGCAGCGG AGAGAGCTTC
TCCTGGCTGC TCAAGAAGAT GAGATAG
 
Protein sequence
MENSGFPVIG SRQAAKAPHE SAIPLAGSRE EPLVDIADSY LPLDEDQTAA PIADPTAVEP 
GRSLALADFA PPALGAGDRL VRLAYRMGLP ASAIHPFRKR AKTRLTATVT PPLPGDPAAG
KALRAGHFLV HGFKSPIADT AFSGPRLPPP FERMVHGFRW LRDLESGGTR AQCTQVAERI
LATWLKANPK PNPTPAWDVG NVGHRLLNWM IHAPLVLSGQ DRGFRSRMLH TIEDTARWLD
RHVAKADDRL GEVAGWCAIV ATGLLMADGK PRRLYGEAGL VRALGELVSD DGGVLSRSPL
CQIEAIELLV SLRACYDAIR SEPLPQIGTM LNLLVPPLLA LLHGDGGLGN WQGAGAIEAD
RIEELVRATG VRTRPLRDAR QWGYQRATAG KAVLQFDAGP PPVARHARDG CASTLAFEFS
HGPDRLIVNC GGAAFAGGLI PLRLEQGLRA TAAHSTLTID DFNSTAVLIN GRLGSGVSEV
EVDRRTLSAD GNGPGATRIE ASHNGYVGRY GLTHRRILIL RDDGSELRGE DLLVPAGRKG
KRGTIGVALR FHLGPHIELA TSADGKGVTL ALPDGSLWQF RSGRDAVSVE ESLWADGQGR
PLATRQLVVT AKVPRSGESF SWLLKKMR