Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1100 |
Symbol | |
ID | 5198221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 1236880 |
End bp | 1240029 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640580647 |
Product | hypothetical protein |
Protein accession | YP_001261604 |
Protein GI | 148554022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCGC AACGCCCTAA ACCTCGGCCC CGGCCGTTTC GCTACCTGAC CATGAGCGAC CTCAGGCCGG TGTGGCTGAA AAGGACCGCG CGCGGTCTCG TCGTCCTGGT CGGCCTGGTG TTGATCCTGC TCCTGGCGGG GTCCACGGCC CTTTGGTTCA CGGTGCAAGA CCGGTTGAAG ACAGCGGACT TCGCGCCGGC GAAACCGCTT TCGTCCTACG CAGACGAGGC GGCCCCTGCG CCTCGGCCCG CGCCCGCGCC GGCCCCGTTC GACTGGGCCG CGATCGACGC CCCGCCAACC TCGGCCTTGC CCTGGGTGCG GTGGTGGTGG CCAGGCGGCA ACGTCGAGCC GAGCGAGTTG CGACACGAGC TCGATCAATT GAAGGCCGCG AACTTCGGCG GTGCCGAGGT CCAGCCCTTC GCCTTCGGCG TCAAGGCGGT GACGGACAAG GACCCCGCGG CGCGGGCGCG CATCGCCGCC TTCGACACGC CTGCCTATTT CGCGATCCTG CGCGGCGTGA TGAAGGACGC CGCCGATCGC GGCCTGCGGA TCGACCTCAC CCACTATAGC GGCTGGCCGG CGGGCAGCCC CGCCGTGGGG GTCGCGGACG GACTGCAGAG CCTCGCCTGG TCGGAGCGTC GCTTCCGCGG CGGACGCCAG GTCGAGATCG CCTTGCCGCG CCCGAAGCCC AGCCTCAACG CCTTGCTGCT GGCGGCCTCG TCACTGATCT CGCCGATGGG CGATATCAGC GATTTCGATC CCTCGCGCGC CCAGTTGCTC AGCGTCCTCG TGGCCCGGCC AAAGGGCGGG GGCCATAGCC TGTTCTCGAC AAAGGACACG CTCCGCCTCG ACGCGGCTTC GGTCCAGGTC GTGGACGCTC AGGTGAAGGA CGGCAAGCTC GTCTGGGACG CGCCGCCGGG TCGCTGGGTG CTGGTCGCCT CGTGGATCCT GCCGGCCGGC CAGCCGCCGA TGTTCAGCGC GCCCGAGCCG CCGGGCTATG CGGTCGACGT CTTGCGCGCG GCCAATGTTC GGGCGCACTA CAACTACGCC TTCGGCCGGC GCACGGGCCT GGACGCCGAA GCCGGCCATG CCTTCCGCGG GATCTTCAAC GACAGCCTGG AATTCGCCGT CGATCGCCTG GGTTCCGCCG ACATCCTGGC CGAGTTCAGG CGTCGGCGCG GCTACGACCT GCGCCCTCAC CTGCCGGTCG TCTTCGTCGA CGCCGCCGAC AGCTTCTATG TGAGCGAATT GCTGCCCTCG CGGAAACCGA ACTTCACCCT GGGCGAGATG GATGACCGCA TCCGCCACGA CTACCAGCTG ACCCTCTCGG ACCTCGTCGT GGAGCGCTTC CTGGACGAGA CGCGGACGTG GGCCGCGGCG CGAGGCCTCA AGTCCCGCGG CCAGGGCTAC GGCATGGACA TCGACGTCCT GCGCGCGCTC GGCGCCAACG ACATACCCGA GACCGAGCAG CTCTACGCCG GCGGCGGCGA GGCCTTCCTG CGGATGGCCG GATCGGCCGC GGCGCTCTAC GGGCGCGACC TGGTGAGCGC CGAGGCCTTC GTCTGGGCCG ACCAGGACTA TGCGTCCACG CCCGCCAAAC TGAAAGCCGC GGCCGACAAG CTGTTCTTGT CCGGGGTCAA CCACGTCATC TACCACGGCT TCCCCTACGA CTGGCGCGCC GGGGATCGCG ACCGCTGGTT CGGGGATCAG GGCTGGGCCC CCTTCGCCAG CGATCCGATG GCCGTCTTCT CGGACAACTA TTCGCCACGC AATCCGCTGT GGGCCGATCT GCCGGCGCTC AACGCCTATA TCGGCCGTTC GCAGAGCCTC CTTCGGCAAG GACGCCAGGC GGCTGACGTC CTGATCTACT ATCCGTTCCT GGGCTATCGT GCGACGGGCT ACGGCCCCGA CGAGCTCAAG GAGCCGCTGT TCCTCGGGGC CTTCCCCTCC GCCGCCCCCG CAGGCCCGCC GCCGAGGACG GGGATGCAAG GAACAGACGA GCGGATCCTC TGGTTGCGCA AGATGGCGCC GGTGTTCGAC GCGCTGAACC GGCGCGGCCT CACCTGGGGC TGGGTGAATG GCGACGGCCT GCGCAACCAA CTTCTGCCCG ATGGCCGCAT GAGGTCCGGC GCGTCCTACG GCGCCATCCT GCTGGCCGAG GTCGAGGCCA TGGCGCCCGA AGACCTCGCC GCCGTCCAGG CGCTGGCGGC GCACAAGGTC CCGGTGGCCC TGTACGGCCG CACGCCCGTC CGCCAGCCGG GCTATCTGGA CGCCAAGGCC GGTGACGCCC GTGTGCGCGC ATCCAGCGCG GCGCTGGCGG CCGCAGCATC GAGCGCCCGG ACGCCCGCGG CTCTCGTGGA CGCTATCGTC GCGGCGAGCA GGCCCGATCT CCGGTTCGCG GGCCGCAGCG CGTTGCAGCG CTACTCCCGC ATCCTGGCGA ACGGCGGCCG GATCGACTTC CTGGCCAATC CCGGCGCCAG CGAGGCGAGC ACGCTGATCT CCACCCAGGA CCACGCGCAG GCCTGGTGGT TCGACGCGCG GACCGGCCGA GCGGCGCCGG CCGTGCGGGA CGCCGAGGGC CGCATCGCCC TGACGCTCCA GGCCTATGAC TCGCGGTTCC TCATCCGAGG CGTCCCCATG CCGGCCCACC TCGCCTCGTC CGCCATGACA ACCAGCCCGG TCACGCGCGC ATGGCCGTTG AAGGGCTGGA GCCTGCGCGT CGGCGGCGAC GTCCGAACGT CGGGGCTCTT CGACTGGCGC AGCGACGAGG CGCTGCGCCA TTCCAGGGCC GAAGGCGTCT ACAGCGCCAA GGTCACCCTG CCCGAGTTGG CGCCCGGGGC GCGTCGCGGC CTGCGCCTCG GCATCGTTCC AGGCATGGCC ACGGTGAGGA TCAACGGCCG CGACGCTGGC CGAGCCAGCC TGCCGCCCGG CGAATTGGAC GTCAGCGGCC TGCTGAAGCC CGGCGTGAAC CGCATCGAGA TCGTCTACCG GCCGACGCTG CGCAACTGGA TGATCGGCCG CGCGGCGGAG GGCGACAAGC GCGCCGCGGC GTTCCGGTCG CGGACCGAGG CGCTTACGCC CGCGGGCCTC CAGCAGCCGA TCAGCATCGT GGAGCATGGC GCGCCCGACG CGCGCGGCCA TACGAAGTGA
|
Protein sequence | MPSQRPKPRP RPFRYLTMSD LRPVWLKRTA RGLVVLVGLV LILLLAGSTA LWFTVQDRLK TADFAPAKPL SSYADEAAPA PRPAPAPAPF DWAAIDAPPT SALPWVRWWW PGGNVEPSEL RHELDQLKAA NFGGAEVQPF AFGVKAVTDK DPAARARIAA FDTPAYFAIL RGVMKDAADR GLRIDLTHYS GWPAGSPAVG VADGLQSLAW SERRFRGGRQ VEIALPRPKP SLNALLLAAS SLISPMGDIS DFDPSRAQLL SVLVARPKGG GHSLFSTKDT LRLDAASVQV VDAQVKDGKL VWDAPPGRWV LVASWILPAG QPPMFSAPEP PGYAVDVLRA ANVRAHYNYA FGRRTGLDAE AGHAFRGIFN DSLEFAVDRL GSADILAEFR RRRGYDLRPH LPVVFVDAAD SFYVSELLPS RKPNFTLGEM DDRIRHDYQL TLSDLVVERF LDETRTWAAA RGLKSRGQGY GMDIDVLRAL GANDIPETEQ LYAGGGEAFL RMAGSAAALY GRDLVSAEAF VWADQDYAST PAKLKAAADK LFLSGVNHVI YHGFPYDWRA GDRDRWFGDQ GWAPFASDPM AVFSDNYSPR NPLWADLPAL NAYIGRSQSL LRQGRQAADV LIYYPFLGYR ATGYGPDELK EPLFLGAFPS AAPAGPPPRT GMQGTDERIL WLRKMAPVFD ALNRRGLTWG WVNGDGLRNQ LLPDGRMRSG ASYGAILLAE VEAMAPEDLA AVQALAAHKV PVALYGRTPV RQPGYLDAKA GDARVRASSA ALAAAASSAR TPAALVDAIV AASRPDLRFA GRSALQRYSR ILANGGRIDF LANPGASEAS TLISTQDHAQ AWWFDARTGR AAPAVRDAEG RIALTLQAYD SRFLIRGVPM PAHLASSAMT TSPVTRAWPL KGWSLRVGGD VRTSGLFDWR SDEALRHSRA EGVYSAKVTL PELAPGARRG LRLGIVPGMA TVRINGRDAG RASLPPGELD VSGLLKPGVN RIEIVYRPTL RNWMIGRAAE GDKRAAAFRS RTEALTPAGL QQPISIVEHG APDARGHTK
|
| |