Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3192 |
Symbol | |
ID | 5198883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3507290 |
End bp | 3510400 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640582738 |
Product | TPR repeat-containing protein |
Protein accession | YP_001263677 |
Protein GI | 148556095 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.080394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCGGA TCAACCGTCT CTCCATGGGG CTCGGCCTGC TCGCGGCAGC GGCGCTCGCG CTTCCCTCGG CGGCGCCGGC GGCGGGCCAG CCCGGCAGCC TGTCGTTGCG CAACAGCTTC CGGATCGGGT CGACCGGCGT GCTCTGCACC GCGCAGATCC GCTCGGCGAG CCCGGTGCTG TCGACGATGT TCGACCGCGG CTATCGGGTG GTGTGCCGCG ACGCCGCGGC TCCGGTCGGC CGGCTGTTCG CGCTGCGCAA GGGCGGCGAC GACGCGGTCG CGCGGGCGAT CGCCCATGCC GAAACCCCGC TGACCTGCCA GGCGCCGGCC AATGCCTCCA TCCAGGGCCT CGGCAACGTC CAGGTCCGCG ACTGCGTCGA CGCGAGCCAG CTCGCCTACA AGGTCTATGC CTATTCCAGG GGGCGGACGG TCTATGTCGC CGACGGCCTG GGCGGTTATG ACAGCGCCCT CCAGCTCGGC CTGCGGACGA TCGTCGCCGA TGCGATCGTC AAGGGCGAGG TGCAGGTCGC GACGACCTCG GCCGGCGATC CCGCCGCCTT CGCGCGCGTC CAGGCGGGCG CGCTCGACAG CGACAGCGCG CTGAGCGAGG CCTATGCCCG CAACAATGAA GGCTCCTACG CCGAGGCGGC CGAGTTCTTC GAGGCGCTGG CCGAGCGCGA CGCGTCGGGC ACGAGTACGG ATTCGCACCT GCCCGAATAT CTCGCCAACC AGGCGTTGCA GGAATCGAAC CAGCGCAGCT TCAGCACCGC CGACGCGCTG TTCGCGCGCG CCGCGACCCC GTCCGCGCTC GCCGATCCGG TGGTCGGGCG GATATTGCGC AACTCGCATG CGCTCCACTA TCTCAACCAG AGCAAGGTGC AGCAGGCGAT CGCCGAGCTG AACAAGCCGG TCGTCGAGAT CGAGAGCAGC GCTGTCGCCG ACGCCAGGCT GAGCCGGGGC GAGATCAGCG ACGAGATCGC CGCCGACCTC AACCGCGAAG GCAGCGCGAT GCGCCAGCTC GGCGCGCTCG ACGGCCAGCT CCAGCCGTTC GAGAAGGCGC AGATCCTCGA TGCGCAGGCG TTGCAGCTTC GCGGCGTCGC CTATCGGATC GACGGCAATT TCCCGAAGGC CAAGGCGGCC TTCGCCGAAG CGATCAACGC GATGCAGGCG ATCCGCGAGG GCCGGATGAA CAGCACCGGC TGGCTGCGGG CGAGCATCCA GTCCGAGCTG GGGCTGATCG CCGAGGCCGA GGGCAATGTC GGCGAGGCCG AGCGCCTCTA TGCCGACGCG CTCCAGGTCG TCGAGATCCA GCATCCGAAC TCGGCGGTGT CGCTGGGCGC CAAGGCGCGC TTCGCGGGCT TCCTCAGCCG CCACGGCCAG ACCGACCGGG CGGCGACGAT GTACGGCCAG GTCGTGTCCG AGGCGGAGAG CCTGCCCGGC GCGATGGCGT CGATCCGGAC CCTGCTGCGG CCCTATTTCC GCCTGCTGGC GGCGCGCGCG GCGACCGATC CGACCGCGGT GACGCGGATG TTCGGCGCGA GCCAGGTGAT GCTGCGCCCC GGCATCGCCC AGACCCAGGC GCTGCTCGCC CGCGAGCTGT CGGGCGGCGA CGACGAGGCG GCCAGCCTGT TCCGCCAGTC GGTGACGCTG TCGCGCTACA TCGCCCGCGC CACCGGCGAG ATCGCGCGGA TGACGGCCAC CCCGAACCCG GCCGAGCGCC CCGCCCTGGC GGAGGCGCAG GCCCGTCTCG CCCGCTATTC GCGCGACCAG ACCGCGTTGC AGGCGAAGCT GGCCCAGTTC CCGCGCTACC GCGTCCTGTC GCCGCAGACC ATGTCGGTCG AGGAGCTGCA GAAGGCGCTC CGGCCCGGTG ACGGCTATTA CAAGGTCACG CTGGTCGGCG ACGACATCTA TGCGATGTTC GTCGCCCAGG GCGTCGCGCG CGCCTGGAAG CTCGACATGA CGGCCAAGCA GCTCGCCGAC GTCGTCGCGC AGATTCGCGA TTCGGTGGTG AAGATCGAGA ATGGGCAGGT CGCGACCTAT CCGTTCGACG TCGTCCTCGC CCGCAAGCTC TACGTCACCC TGATGGGGCC GGTCGACGCG GACATGCACA AGGTCACCAA CCTGATCTTC GAGCCCGACG GCCCGCTGCT CCAGCTTCCG GCCAACCTGC TGCCGATCGA CCAGGCCGGC GTCGACGCCT ATCTGGGCCG GCTGAAGAAG CCGAACGCCG ACGATTTCGA CTTCCGCGGG GTCAACTGGC TGGGCCGCGA CCGCGACATC TCGACCGTGG TCAGCCCGCG CTCGTTCGTC GACGGCCGCG ACGTCGCCAG CTCGAAGGCG ATCAAGGCCT ATCTCGGCCT GGGCGAGAAT GCCCGGCCGG CGCTCAACCC GCTGTTCGTC CCGCCGCCGG CGATGGCCGA TCCCTGCGCC TGGCCGCTCG GCAACTGGAA CAATCCGATT TCCGCGTCCG AGCTTTATCT GGCGAGCGGG ATCATCGGGG CGGACAAGTC GCGGCTGGTG ATCGACGCGG CCTTCTCCGA CTCGACGATC ATGGGGATGA GCGACCTCAA CGAATATCGC ATCATCCACT TCGCGACCCA CGGCCTCGTC ACCGCGCCGC GCCCCGAATG CCCGGCGCGC CCGGCGCTGC TGACCAGCTT CGGCGGCGGC ACCTCGGACG GGCTGCTGTC GTTCAAGGAG ATCTTCGACC TCAAGCTCGA CGCCGACGTG GTGATCCTGT CGGCCTGCGA CACGGCCGGC ATGGCGACCG TCTCGGCGAC CCGCGAGGCG GGCATCACGA CCGGCGGCAA TTTCGCGCTC GACGGCCTCG TCCGCGCCTT CGTCGGCGCC GGCGCCCGCA CCGTGATCGC CAGCCACTGG CCGGTGCCCG ACGACTATAA CGCGACCAAG CGGCTGATCA GCGGGCTGTT CACGGCGCCG CCGGGAACGC CGATGGCGAC CGCGATGCGG CAGGCGCAGC TCGGCCTGAT GGACGACGCC AACACCTCGC ATCCCTATTA CTGGTCGGCC TTCGCGATCG TCGGCGACGG CGAGCGGCCG TTGCTGCCGA CCGGCGCCGC GCCGGGCGCC GCCGTCCAGG CGACCCGCTG A
|
Protein sequence | MTRINRLSMG LGLLAAAALA LPSAAPAAGQ PGSLSLRNSF RIGSTGVLCT AQIRSASPVL STMFDRGYRV VCRDAAAPVG RLFALRKGGD DAVARAIAHA ETPLTCQAPA NASIQGLGNV QVRDCVDASQ LAYKVYAYSR GRTVYVADGL GGYDSALQLG LRTIVADAIV KGEVQVATTS AGDPAAFARV QAGALDSDSA LSEAYARNNE GSYAEAAEFF EALAERDASG TSTDSHLPEY LANQALQESN QRSFSTADAL FARAATPSAL ADPVVGRILR NSHALHYLNQ SKVQQAIAEL NKPVVEIESS AVADARLSRG EISDEIAADL NREGSAMRQL GALDGQLQPF EKAQILDAQA LQLRGVAYRI DGNFPKAKAA FAEAINAMQA IREGRMNSTG WLRASIQSEL GLIAEAEGNV GEAERLYADA LQVVEIQHPN SAVSLGAKAR FAGFLSRHGQ TDRAATMYGQ VVSEAESLPG AMASIRTLLR PYFRLLAARA ATDPTAVTRM FGASQVMLRP GIAQTQALLA RELSGGDDEA ASLFRQSVTL SRYIARATGE IARMTATPNP AERPALAEAQ ARLARYSRDQ TALQAKLAQF PRYRVLSPQT MSVEELQKAL RPGDGYYKVT LVGDDIYAMF VAQGVARAWK LDMTAKQLAD VVAQIRDSVV KIENGQVATY PFDVVLARKL YVTLMGPVDA DMHKVTNLIF EPDGPLLQLP ANLLPIDQAG VDAYLGRLKK PNADDFDFRG VNWLGRDRDI STVVSPRSFV DGRDVASSKA IKAYLGLGEN ARPALNPLFV PPPAMADPCA WPLGNWNNPI SASELYLASG IIGADKSRLV IDAAFSDSTI MGMSDLNEYR IIHFATHGLV TAPRPECPAR PALLTSFGGG TSDGLLSFKE IFDLKLDADV VILSACDTAG MATVSATREA GITTGGNFAL DGLVRAFVGA GARTVIASHW PVPDDYNATK RLISGLFTAP PGTPMATAMR QAQLGLMDDA NTSHPYYWSA FAIVGDGERP LLPTGAAPGA AVQATR
|
| |