Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4107 |
Symbol | |
ID | 5199306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 4512321 |
End bp | 4513979 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640583663 |
Product | TPR repeat-containing protein |
Protein accession | YP_001264588 |
Protein GI | 148557006 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATGA AGCGCGGACG GGTCGCCATC GCCGCGGGGC TGCTGCTGGC CTGGCCGGCC ATGGCCCCCG CCATGGTCAA CGACGCCTCG GCGGAGATGC GGCAATATGT CCACGCCCGG CTCGCCGATG CCGCCGGCAT GCCCGAAGCG GCCGCTGCCA GCTATGCGCG CCTGCTCCAG GCGTCGCCGC AGGACAAGCG GCTGGCGCTG CGCACCTATC GCCAGGCGTT GACCGCGGGC AATTACAAGC TCGCCGGGCT CGCCACCGCG CAGCTCGACC GGCTCGGCGC CCTGCCCCCC GACGGCACGC TGTTGCTGTT CGCCGAGGCG GTCACCGCGC ACGACTGGAA GCGCGCCAAT GCGACGATCG GGCGGATCGA GCGCGAACAG GTGTTCGGCT TCCTCGCGCC GGTGATGCGC GGCTGGGTCG CCTATGGCCG CCAGTCGTCG GACGCCGCCC GGCTCGCCGC GCCGGCCGGC GGCTCGCAGC TTTCGAACGC CTATTCGCGC GACCATTATA TGCTGATCGC GCTGGCGATG GGCCGCCATG AGGTGCTGGC CGACCTGCGG CGGCTGATCG CGGCGCACGA CGTGCGCTCG CTGCGCCTTC AGCTCGCCGC CGCGGCGCTG CTCGCCAAGC GCGGCGACAT GGCCGATGCC CGCGCCATCC TCGACGGCCA GACGCCCGAG CTGATCCAGG CCCGCGCCAC GCTCGACGCC GGCAAGCCGC TCGGCGGCGC GATCGACACG CCCGAGCTGG GCCTGTCCGA CCTGTTCGCG CAGCTCGCGA TCGACGTGAA GGGCGACGGC CGCTCCCCCG TCTCGCTGCA ACTCGCCCGC ATCGCCGGCT ATCTGGCGCC GACCAACGCC GCGGCGATCA TCGCCACCGC CGACCTGCTG ACCGCCAACG GCTATCATGA CGCGGCGCTC GCCCTGCTCG ACAAGGTGCC CGCCGACGAT CCGCTCCGCG AAGCCGCGCG GCAGGAACGC AGCGGCATCC TGCTGTCGAT GGGCAATCGC CAGGCCGCGC TCGCCGATGC GCAGAAGGCC GCCGCGCAAC CCGATGCCAG CGCCGCGACC CTGGTCGAGC TGGGCGGCAT CCTTTCCGAT CTCGACCGCC CGGCCGAAGC GGTGAAGGCC TATCAGCGCG CGATCGACAT CGATACGGCG CAGGGTATCC CGAACTGGGC GCATCTGTTC CTGCAGGCGG GCGCGCTCGA CCGCGCCGGC GACTGGGAGG GCGCCAAGGA CCGGCTGCGC CAGGCCGGCA AGCTGGCGCC GGGCCAGGCG GTGATCCTCA ACTATCTCGG CTATGGCATG CTCGATCGCG GCGAGAACCT GCCCGAGGCG CAGGCCTATA TCGAGCGCGC GAGCAGCCTC GATCCCAACG ACGCGGCGAT CGCCGATTCG CTCGGCTGGC TCTATTACAA GCGCGGCAAC TATCCGGGCG CGATCGCCGC CCTCGAACGC GCGGTGGCGG GCGAGCCGGG CCAGTCGGTG ATCAACGAGC ATCTCGGCGA CGCCTATTGG GCGGTCGGCC GGCGGATCGA GGCGCGCTAC GCCTGGCGCG CGGCGCTGGT CCAGGCCGGC AAGACCGACA GCGAGCGGAT CAAGCGCAAG ATGGCCGACG GCCCCGGCGA CCGGCTCAGC GCGAACTGA
|
Protein sequence | MGMKRGRVAI AAGLLLAWPA MAPAMVNDAS AEMRQYVHAR LADAAGMPEA AAASYARLLQ ASPQDKRLAL RTYRQALTAG NYKLAGLATA QLDRLGALPP DGTLLLFAEA VTAHDWKRAN ATIGRIEREQ VFGFLAPVMR GWVAYGRQSS DAARLAAPAG GSQLSNAYSR DHYMLIALAM GRHEVLADLR RLIAAHDVRS LRLQLAAAAL LAKRGDMADA RAILDGQTPE LIQARATLDA GKPLGGAIDT PELGLSDLFA QLAIDVKGDG RSPVSLQLAR IAGYLAPTNA AAIIATADLL TANGYHDAAL ALLDKVPADD PLREAARQER SGILLSMGNR QAALADAQKA AAQPDASAAT LVELGGILSD LDRPAEAVKA YQRAIDIDTA QGIPNWAHLF LQAGALDRAG DWEGAKDRLR QAGKLAPGQA VILNYLGYGM LDRGENLPEA QAYIERASSL DPNDAAIADS LGWLYYKRGN YPGAIAALER AVAGEPGQSV INEHLGDAYW AVGRRIEARY AWRAALVQAG KTDSERIKRK MADGPGDRLS AN
|
| |