Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3456 |
Symbol | |
ID | 5200975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3792229 |
End bp | 3794046 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640583004 |
Product | hypothetical protein |
Protein accession | YP_001263940 |
Protein GI | 148556358 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.203936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.029844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGAC GTCAATTCAT CGGCTCGACC GGTCTCGCCC TTCTCGCCGG CATGATGCCG CGCCTGGCCG AGGCCGCGGC GACCGGCACC GCGGCCGACA AGCAGCTCGC CGCGATGGTC GAGAGGATGT TCGAGAAGCA GCTCGACGAC AGCCCCGAGA CCGCGACCAG CCTCGGCCTC GACAAGGGCG CCCGCGCCGG ACAGAAGAGC AAGCTCGACG GCTACACCAA GGCCGACCAG GCCAGGACCC TCGCCGAGGA CAAGGCGTCG CTCGCCGCGC TGCGCAAGCT CGACCGCGCC GCGCTGTCGC CCGCCGCGCA GCTCAACTAC GACGTCGTCG TCTATATGGT CGAGCAGACC ACCAGGGGCG GCTCCAAATA TCCCTATGGC AGCAGCAGCG AATATTATGT CCCCTATGCG ATCAGCCAGC TCAGCGGCCC CTATGCCTCG GTCCCGGACT TCCTCGATTC GCAGCATGTG ATCGAGACGA AGGACGACGC CGACGCCTAT GTCGCGCGCC TGCACGAATT CGCCCGCGTC CTCGACGAAA GCACCCGCTT CCAGCAGGAC GACGTCCGCT ACGGCGCCTT CGCCCCCGAT TTCTGCCTCG ATCTCGCGCT CGGGCAGCTC CGGGCGCTGC GCGACCAGCC GGCGGCGAAG ACCGTCCTCG TCGACTCGCT CGTCCGCCGG ACGGCCGAGA AGAAGATCGC CGGCGACTGG GGCGCCCGGG CCGAGACGAT CGTCGCCGCC GAAATATTCC CCGCGCTCGA CCGGCAGATC GCGCTCGTCA CCGGCCTGCG CAAGGCGGCC AGCCATGACG CGGGTTGCTG GCGCCTGCCC AAGGGCGACG AATATTATGC CGACGCGCTG ATGAACGCGA CGACGACGAC GCTCTCGCCC GAGGAGGTCC ACCAGATGGG CCGCGAGCAG GTCGCCGAGA TCAGCGGCCA GATCGATGCG ATCCTGAAGA AGGAGGGCAT GACCAAGGGG ACCGTCGGTG AGCGGCTGAC CGCGCTCAAC AACGATCCGA AGCAGCTCTT CGCCAACACC GACGCGGGCC GGGCCGAGCT GATCGACTAT ATCAACGGGC TGGTGAAGGC GATGGAGGTG AAGCTGCCCG AGGCGTTCGC GACCCTGCCC AAGGCGCCGC TCGAGGTGAA GCGGGTGCCG CCCTTCATCC AGGACGGCGC GCCCAACGGC TATTACAACT CGGCCGCGCT CGACGGTTCG CGCGGGGCGA TCTACTATAT CAACCTCAAG GATACCGCCG ACTGGCCGCG CTACGGCCTG CCCAGCCTGA CCTATCATGA GGGCACGCCG GGCCATCACC TCCAGATCAG CCTCGCCCAG GAGGCGAAGG ACCTGCCGAT GCTGCGCAAG GTCGCGCCGT TCGGCGCCTA TGTCGAGGGC TGGGCGCTCT ATGCCGAGCA GCTCGCCGAC GAGATGGGCG TCTATGAAAA GGACCCGATC GGCCGCGCGG GCTTCCTGCA GAGCTTTCTG TTCCGCGCGG TCCGGCTGGT CACCGACACC GGCATCCACT TCAAGCGCTG GAGCCGCGAG CAGGCGACCG ACTACATGGT CGAGGCGACC GGCTTCGCCC GCCCCCGCAC CCAGCGCGAG GTCGACCGCT ACTGCATCTG GCCGGGCCAG GCGTGCAGCT ACAAGGTCGG CCATATGAGC TGGGTCAAGG CGCGCGAGAA GGCGAAGGCG ATCAAGGGCG CGGCGTTCGA CCTGCGCCAG TTCCACGAGG TGCTGCTCGA AGGCGCGCTG CCGCTGACCA TCCTCGAACA GGTCACCGAG GCGCGGGCGA AGGCATAA
|
Protein sequence | MDRRQFIGST GLALLAGMMP RLAEAAATGT AADKQLAAMV ERMFEKQLDD SPETATSLGL DKGARAGQKS KLDGYTKADQ ARTLAEDKAS LAALRKLDRA ALSPAAQLNY DVVVYMVEQT TRGGSKYPYG SSSEYYVPYA ISQLSGPYAS VPDFLDSQHV IETKDDADAY VARLHEFARV LDESTRFQQD DVRYGAFAPD FCLDLALGQL RALRDQPAAK TVLVDSLVRR TAEKKIAGDW GARAETIVAA EIFPALDRQI ALVTGLRKAA SHDAGCWRLP KGDEYYADAL MNATTTTLSP EEVHQMGREQ VAEISGQIDA ILKKEGMTKG TVGERLTALN NDPKQLFANT DAGRAELIDY INGLVKAMEV KLPEAFATLP KAPLEVKRVP PFIQDGAPNG YYNSAALDGS RGAIYYINLK DTADWPRYGL PSLTYHEGTP GHHLQISLAQ EAKDLPMLRK VAPFGAYVEG WALYAEQLAD EMGVYEKDPI GRAGFLQSFL FRAVRLVTDT GIHFKRWSRE QATDYMVEAT GFARPRTQRE VDRYCIWPGQ ACSYKVGHMS WVKAREKAKA IKGAAFDLRQ FHEVLLEGAL PLTILEQVTE ARAKA
|
| |