Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0105 |
Symbol | |
ID | 3915991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 107549 |
End bp | 109435 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640442830 |
Product | heparinase II/III-like |
Protein accession | YP_495388 |
Protein GI | 87198131 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACA GCGGCTTCCC CGTCATCGGA AGCCGCCAGG CGGCCAAGGC GCCGCACGAG TCGGCGATTC CGCTGGCAGG CTCGAGGGAG GAACCGCTGG TCGATATCGC TGACAGCTAT CTTCCCCTCG ACGAGGACCA GACCGCCGCT CCCATCGCCG ATCCGACCGC GGTGGAGCCG GGACGTTCGC TCGCGCTTGC CGATTTCGCC CCGCCCGCAC TTGGCGCGGG AGACCGGCTG GTCCGGCTCG CCTATCGGAT GGGCCTTCCC GCGAGCGCGA TCCATCCCTT CCGCAAGCGG GCGAAGACAA GGCTGACGGC AACAGTTACG CCGCCCCTCC CCGGAGATCC GGCGGCCGGC AAGGCACTGC GTGCGGGGCA TTTCCTAGTC CACGGCTTCA AGTCGCCAAT TGCCGACACC GCATTTTCGG GCCCGCGCCT GCCACCGCCG TTCGAACGGA TGGTTCACGG CTTTCGGTGG TTGCGCGATC TCGAGTCGGG CGGGACGCGG GCGCAGTGCA CGCAAGTGGC CGAACGCATC CTGGCAACCT GGCTGAAGGC AAACCCCAAG CCGAACCCGA CGCCCGCATG GGATGTGGGC AACGTCGGCC ATCGCCTGCT CAACTGGATG ATCCATGCGC CGCTCGTCCT GTCTGGCCAG GACCGCGGCT TCCGCAGCCG CATGCTGCAC ACGATCGAGG ATACCGCACG CTGGCTCGAC CGGCACGTCG CCAAGGCTGA CGACCGGCTA GGTGAAGTGG CGGGCTGGTG CGCCATCGTT GCCACCGGCC TGCTCATGGC CGACGGAAAG CCACGCCGCC TCTATGGCGA AGCCGGCCTA GTACGCGCGC TGGGCGAACT GGTCAGCGAC GATGGCGGCG TATTGTCGCG CAGCCCGCTC TGCCAGATCG AGGCGATAGA ACTGCTCGTC AGCCTGCGCG CCTGCTACGA CGCGATACGG TCGGAACCGC TGCCGCAGAT CGGGACCATG CTGAACCTGC TGGTTCCGCC GCTGCTAGCG CTGCTGCATG GCGATGGCGG ACTGGGCAAC TGGCAGGGCG CAGGGGCCAT CGAGGCTGAC CGGATCGAGG AACTGGTCCG GGCAACCGGC GTGCGCACCC GTCCGCTGCG CGATGCACGC CAGTGGGGTT ACCAGCGCGC AACGGCAGGC AAGGCCGTGC TCCAGTTCGA TGCAGGGCCA CCACCCGTGG CCCGCCACGC GCGCGACGGA TGCGCTTCTA CCCTGGCTTT CGAGTTCAGC CATGGACCGG ACCGGCTCAT CGTCAACTGC GGCGGCGCGG CGTTTGCCGG CGGACTGATT CCCCTGCGGC TTGAGCAGGG CCTGCGCGCG ACGGCTGCGC ATTCGACGCT GACCATCGAC GATTTCAACT CAACCGCAGT CCTCATCAAC GGTCGCCTCG GTTCGGGCGT TTCGGAGGTC GAGGTCGACA GGCGTACGCT TTCCGCCGAC GGCAACGGTC CGGGCGCGAC GCGCATCGAG GCCAGCCACA ACGGCTATGT GGGGCGCTAT GGCCTGACCC ATCGCCGCAT CCTGATCCTG CGCGACGATG GCAGCGAACT GCGCGGCGAA GACCTTCTGG TGCCAGCAGG GCGCAAGGGC AAGCGCGGAA CCATCGGCGT CGCCTTGCGA TTCCATCTCG GTCCGCATAT CGAGCTCGCC ACCAGTGCGG ACGGGAAAGG CGTGACGCTC GCCCTGCCCG ACGGGAGCCT GTGGCAGTTC CGCTCGGGCC GCGATGCGGT GTCGGTCGAG GAAAGCCTCT GGGCAGACGG GCAGGGACGC CCGCTGGCAA CGCGCCAGCT TGTCGTCACA GCCAAGGTTC CACGCAGCGG AGAGAGCTTC TCCTGGCTGC TCAAGAAGAT GAGATAG
|
Protein sequence | MENSGFPVIG SRQAAKAPHE SAIPLAGSRE EPLVDIADSY LPLDEDQTAA PIADPTAVEP GRSLALADFA PPALGAGDRL VRLAYRMGLP ASAIHPFRKR AKTRLTATVT PPLPGDPAAG KALRAGHFLV HGFKSPIADT AFSGPRLPPP FERMVHGFRW LRDLESGGTR AQCTQVAERI LATWLKANPK PNPTPAWDVG NVGHRLLNWM IHAPLVLSGQ DRGFRSRMLH TIEDTARWLD RHVAKADDRL GEVAGWCAIV ATGLLMADGK PRRLYGEAGL VRALGELVSD DGGVLSRSPL CQIEAIELLV SLRACYDAIR SEPLPQIGTM LNLLVPPLLA LLHGDGGLGN WQGAGAIEAD RIEELVRATG VRTRPLRDAR QWGYQRATAG KAVLQFDAGP PPVARHARDG CASTLAFEFS HGPDRLIVNC GGAAFAGGLI PLRLEQGLRA TAAHSTLTID DFNSTAVLIN GRLGSGVSEV EVDRRTLSAD GNGPGATRIE ASHNGYVGRY GLTHRRILIL RDDGSELRGE DLLVPAGRKG KRGTIGVALR FHLGPHIELA TSADGKGVTL ALPDGSLWQF RSGRDAVSVE ESLWADGQGR PLATRQLVVT AKVPRSGESF SWLLKKMR
|
| |