Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1942 |
Symbol | |
ID | 5200343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 2175734 |
End bp | 2177461 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640581487 |
Product | heparinase II/III family protein |
Protein accession | YP_001262440 |
Protein GI | 148554858 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG CGGGCGGGTC CGGCTTCGAC GATCGTGTGG AGCCCGGTCG GCGCCTGATC CGCGTCGAGC ATGACCGCAG CCATTCGCTC GCCGAGCGCC TCGCCGGCCG CTTCCATGCG CTGGCCTGGC GCACCCCGGT CCACGGCCTG CGGCTGCGCG GACGCTACCC GCTCAAGCTC TACGACGTGC CGCCCGACCC GATCGAGGGG CTCGTCCGGC TCGGCGGCGC GATGCTCGAC GGCGAGATAT TGTGGCAGGG CGAAAGCGTC GCGATCGAAA GCTATGATTT CCGCCCGCGC GCGATGTCGG CGGCGTTCAG CGACCATCTC CAGAGCTTCG CCTGGCTGCG CGACCTCAAC GCCGCCGGGC CGCGCCAGCG CGTCGCGCCG ATCGCCGAGC TGCTGACCGG CCGCTGGCTC GGCGCGTTCG GCAAGCAGAT CCACGAAGCC GCCTGGCGTC CCGACCTGTG GGGCCGCCGC ATCCTGTTCT GGGGCTGCCA CGCGCCGCTG ATCATGTCGT CGGACGAGCT GCGCGCGCCG GTGCTCAACG CGCTCGCCCG CGGCGCCAAG CATCTCGACC GCAGCGCCGA CAAGGCGGCG CCCGGCCTGC CCCGCGTCGC CGCCTGGGCC GGGGTGATCG CGGCCGGGCT GCTGATCCCC GGCGGCGAGC CGCGCCAGCT TCATGGCGAG AAGGGCATGG AACGGGCGCT GGCCGGCGCG CTGCACGGCG ACGGCGGTAT CGTCAGCCGC TCGCCGGTCG AGCAGATGGA CCTGATCGGC CTGCTCGCCA TGCTGCGCCG CTATTACGAG ATGCGCGGCC AGCGGCTGCC CGCCCCGATC GGCGACGCCA TCGCCCGCGC CGCGCCGCCG CTGCTGGGCC TCACCCTGGG CGACGGCGGC CTGTCGAGCT GGCAGGGCGC CGCCCCGATC GACGGCGGCC GGGTCGACGC GATCGTCGTC GCGTCGGGCG TCCGCGCCCG GCCGCTGCGC CAGTCGCGCG ACTGGGGCTA TCAGCGGCTG TCGGTCGGCC ATAGCCGGCT GGTCGCCGAC GCCGCGCCGC CGCCGGTCTC GCGCTTCGCG ACCCATGCCT GCGCCTCGAC CCTCGCCTTC GAGATGTCGG ACGGGCCGTG GCGGCTGATC GTCAATTGCG GGGGCGGCCG GGGCGCCAAC AACGCGCTCC ATCCCGACCT GGCGCAGGCG CTGCGCAGCA CCGCCGCCCA TTCGACGCTG GTGCTGGCCG ACAGCAACTC GACCGCGATC CATGGCGACG GCACGCTCGG CAAGGGCGTG ACCGAGGTCG AGGTCGAGCG GCAGGAGGAC ATGCACGGCA GCAGCATCGA CATGCGCCAC GACGGCTATG TCCGCCGCTT CGGCTTCAGC CATCGCCGCC GCCTCGTCAT CGGCGCGGGC GGGCGCGAGG TGCGCGGGCA GGACATGCTG ATCCCCGAGG GACGCCGCCC GCGCGGCGAC GGCGCCGACT ATGCGATCCG CTTCCACCTG GCGCCCGAGG TCGACGTCTC GGCGACGGCC GACGGCAGCG GCGCGCTGCT GCGGATCGCC GGCGGCGCGC TGTGGCGCTT CCGCGTCACC GGCGGCGAGG TCGGGATCGA GGACAGCCTG TGGATCGACT CGCGCGGGCG GCCGCAGGCG ACCCGCCAGC TCGTCATCGC CGGCACCGCC GCGGCCGACG GGATCGACGT GAACTGGGCG CTCGCCCGGT CCGAATAG
|
Protein sequence | MSDAGGSGFD DRVEPGRRLI RVEHDRSHSL AERLAGRFHA LAWRTPVHGL RLRGRYPLKL YDVPPDPIEG LVRLGGAMLD GEILWQGESV AIESYDFRPR AMSAAFSDHL QSFAWLRDLN AAGPRQRVAP IAELLTGRWL GAFGKQIHEA AWRPDLWGRR ILFWGCHAPL IMSSDELRAP VLNALARGAK HLDRSADKAA PGLPRVAAWA GVIAAGLLIP GGEPRQLHGE KGMERALAGA LHGDGGIVSR SPVEQMDLIG LLAMLRRYYE MRGQRLPAPI GDAIARAAPP LLGLTLGDGG LSSWQGAAPI DGGRVDAIVV ASGVRARPLR QSRDWGYQRL SVGHSRLVAD AAPPPVSRFA THACASTLAF EMSDGPWRLI VNCGGGRGAN NALHPDLAQA LRSTAAHSTL VLADSNSTAI HGDGTLGKGV TEVEVERQED MHGSSIDMRH DGYVRRFGFS HRRRLVIGAG GREVRGQDML IPEGRRPRGD GADYAIRFHL APEVDVSATA DGSGALLRIA GGALWRFRVT GGEVGIEDSL WIDSRGRPQA TRQLVIAGTA AADGIDVNWA LARSE
|
| |