Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3191 |
Symbol | |
ID | 5198882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3502698 |
End bp | 3507293 |
Gene Length | 4596 bp |
Protein Length | 1531 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640582737 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001263676 |
Protein GI | 148556094 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.670297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGCA GCGTTTCCGA CATCTTCGCC GCCGAGCGGC GCCGCCAGCG GTCGATGCCG CGCACGATGA CCTCGATGGC GGTGCTCGCC ACCGTCCTCG CCATGCCGGT CAAGGAAAGC CTCGCGGCGA ACAGCTACGG CTTCCTGGGC ATCCCGGACG GCTGCGTGAC CGCGACCTGC TCGGGCACCG GCTACACCAT CGACCGGACC TTCGCGACGA GCACCGACAC GGTGACGGTC AACGCGCCTT CCGCGCTCAT CACCTGGGTC CCCAACGACA CCAACGGCGG CACCGCCTCC GTCCAGGACG CGATCAACTT CCTGCCCTCC GGCAACACCG TCAACTATGT CGCCAATGGC GCTTTGCCCA ACGGCGAATA TACGATCCTC AACCGGATCG TTCCCCAGCC CAACGCCGAC GGCCGGTCGA TCGCGCTGAA CGGCACGATC ACCACCGACA CCGCGAAGGG CAATGTCTGG TTCTACAGCC CCAACGGCAT CGTCGTCGGT TCCACCGCCT CGTTCGATGT CGGCGGCCTT CTCCTGACGA CGTCGAACAT CACCGCCGGC AACGTGACGC TCGATGGCGA CGGCTACGTC ACCGCGGTCA ACACCCAGAC CAGCATCGCC GGCGCGGCGG TCACGATCCA GTCGGGCGCC GACATCACGG CCGGCAATTA TTTCGGCATC CTCGCGCCGA AGATCGATCA GGGCGGCTCC GTCGAATCGG ATGGCGGCAT CGCCTATATC TCGGCGGAGC AGGCGAAGCT GTCGATCGAC GACGCGAGCG GCCTGTTCGA CATCGAGGTC GGGGTCGGTG CCGACTCGAT GGGCGGCATC CTCCACGGCG GCACCAGCCG GGGCGTGATC GCCACCGCCG ACGGCCTCGT CCACCCGATC TATTTCGTCG CGGTGCCGAA GAACGACGCG ATTTCGATGC TGCTCGGCGG CAATATCGGC TTCCAGGTGG CGCAGGGCGC GGAAGCCACC GAGCACGGCA TCGTCCTGTC GGCCGGTCGC GACATCGCCT TCGGGTCGCT CAATGCGGGC GGCGGCCTCT CCCAGGGGGC TGGCACCGGA TCGGTCAGCA TTTCCGGCCT GCCGACGCTC AACGCCGGCA CCCTCATCCA GGCGGGCGTT CCGCTGACCG TCTCCGGCAA CGTGCAGATC GCGGCCAATG GTGCGCCCGT CACCATCGAC GCGGCGAGCG ACATCAACTT CACCGGCACC GTTTCGATCG ATACCTCCGT CCTCGGCAAT CCCGTCGCCC CGGTGCAGGG CGGCGCGATC TCGCTCCTCG CGCAGGGCGG CACGATCGAC TTCGGCAGCG ACGTCACGCT GCTGGCGAAC GCCCAGGCTT TCGACGAGCA GAACGGCACC GGCGGCGCGA TCCTGATCTC CGCGCTGGGA CCGAACAGCG GGATCGCCTT CCATGGCGGC CTCACCGGGG CGGCGGACGG CAATGGCGGC CTGCCCAGTT CGGTCGGTGA TGGCGGCGAC GGCTTCGGCG GCACCATCCT GGTCACCGCC GGCGGCGGCG GATCGATCAC CGCCGACACC GGCATGAGCC TGACGGCGGT CGGCCGGGGT GGCGAGACCT TTGAGGGCAC GGGCGGCAAC GGCCAGGGCG GCTCCGTCGC GCTGGTCGCC AATCCCGGCG GCACCATCAC CGTGCCGCAG GGCTCGATCA CCCTCGACGC GAGCTCGGTC GGCGGCAATG TGTATCAGAC GGGCACCGGC GGCAACGCCC TGTCGGTGCT GCCCGGCAGC GGTTCGCCGA AGGCCTATGT CGCGTTGCAG GCCAATGGCG GCGTGATCGA CCTTGGCAGC GACCAGGTTC CGGTCGATCT CTACCTGCGC GCCAACGGCA CGGCCGGAGA CGGTCCGACC GGCGGCACCG CCCATGGCGG CGCCATCGAG ATATTGGCGT CGGGCGCGGG CGGCGGCATC AACGCCAACA GCGCGGTGAG CGCCCAGGCC TATGCCCGGG TCGGCAGCGA CGGCAGCTTC GGCGGCGGCA ACGCCACCGG CGGCCTGTTC AACATGGTGT CGAGCGGTGG CTTGATCACC CTCAATTCGC TCGATGCCCA GGTCTATGGC CAGGGCGGCG ACAACAATAA TAGCGGCGCG GGCGGCATCG GCACCGGCGG CAAGGCCTAT ATCGAGGCGC AGGGCGGCAC GGCCCTCGCG ATCGAGGGCA GCACCTTTCT CGATGCGAGC GGCACCGGCG GGCAGGGCAT AACGGCGGGC GGCGCCGGCA CCGGCGGCGC CGCGCAGATC TTCGCCAATG GCGGGGACAT CACCCTGGGC GGCGGGCAGA CCATCATCAA CCTCGATGCG AGCGGCACCG GCGGCGGCAG CGAGGTCGAC GGCCTTGGTT CCGGCGTCGG CGGCACCGGC ACCGGCGGGG TCGCCGGCTT CGAGGCCAAT GGCGCCACGC TTACCCTGGA CGGCTCGGCG GTCGCCAATG CCAGCGGCAC CGGCGGCAAC GGCCGAACCG GCGGCAACGG TGTGGGCGGT CGTCCGCCGG GCCTGGAAAC GCAGGATTAT GATCCCTTCT ACGGCGTCTA TGCGCAGGCG AGCGACGGCG CGATCAACGC GACGGTCAGC CTGACCCTGG TGGCGGACGG CTATGGCGGC AGTACTGCCT CCGACGGCTC CAATGTGGGG CATGCGGGCA ACGGTACCGG CGGCTATGCC GAAATGCTCG CCACCAACAT CGGTGGCCCC ACCTCCTTGG CGACGATCAG CGCCCCTTCG GTGCAGATGT CGGCGGCGGG CGCGGGCGGT CACGGCGGCG ATCCCGGCGC GGACGGGGTC GGCAGCTCCG GCGGCGTCGG CCAGGGCGGC GTCGTCGTCA TGGGCGCCTA TGCCGGCCGG GGCCAGCTTT CGCTGGGTAC CGTTTCGCTC TTTGCCACGG GTGTTGGCGG CGACGGCGCC GACGGCGAAT CGCAGAGCAC GGGTAACGGC GGCACCGGCG GCTCCGGCGG CGCCGGTATC GGCGGCAATA TCCAGGCCGG CGTCTTCAGC GGCCCGCAGA CCGCCGAGAA CAACGGCTCG GCCGACATCC AGAGCCTCCA AATGGATGCG TCGGGCCTGG GCGGCGCGGG CGGCAGCGGC GGGTCGTCGG GCGAGGGCAC GGGCGGCAAT GGCGGCGCGG GCGGCGCGGG AATCGGCGGC GGTGGTACCG ACGCCGGCAG CCTTGCCATC CTGGCGCGCG GCGCGCCGGT GACGATCGGT TCGGCCACGC TCAATGCCTT CGCCCAGGGC GGGGTGGGCG GCAGCGGCGG CAGCGGCTTG AGCACCTCGG GCACCAACGG TGCCGATGGT GCCGGCAAGG GCGGCCATTT CGCGCTCCTC TCGACCTATC GCTATCAGAC CACGACCGTG CCGGGATCGC TCTCGATCGA CGTGCTCAAT GCCGATGTTT CCGGTGCCGG AAACACCGAA GCGCCGAGCG GGCCGACCAC AGCCGGCAAC TTCGTGATCG CGACGAACGG CGGCGACCTG ACGATCTCCA GCGGGGACGT GTATGCCAAT GGCCAGCTCG CCCCGACCAA CCAGTTCCAG AACGTATTCC TCGACGTGAC GACGGTGAGC GAGGTCTCGG CGGGCAACGG CACCGTCACG CTCGGCACGT CTCCCAACAC TTTCAGGATC CACACCGAGG CGGCGCAGGG CTTCCATCCG GTCAACTTCT ACACCGATCC GGGCGGCCAG TTCGCGGCCA ACCTGGCCAA TTGCACGCTC AATGGCGTGG TCTGCCAGTC GGTTGCGCGG GCGGTGTCTC CGCCGCCGCC CCCACCCCCG CCGCCGCCGG TGTCGCCGCC CCCGCCGCCA CCCCCGCCGC CGGTGTCTCC GCCGCCGCCA CCGCCGCCTC CGCCGGTGTC GCCGCCCCCG CCGCCTCCGC CGGTGTCGCC ACCTCCCCCG CCGCCGCCAC CGCCGCCGGT GATCGAGGAT CCGGCGGTGA ACGAGACGGT CACCACCGAG ACGACGAACA TCACGTCGAG CATCCAGTCG ACGCTGACCG GCTCGCGCAC CGCGGGCGGC AGCATCAAGA GCACCGAGAC GACGGGAACC ACCGCGCCCG GTTCGGACCC GGCCGGCGGC TCGGCCGCGT CGGCGGGCGA TGACGAGGGC GGCGATGGCG ACGGCAGCGA CGATGCCGGC GGCTCGACCT CGAGCGGCGG CAGCGTCGGC GGCCCGAACC TGCTGATCGA CACGAGCAAC ATCGGGTCGG ACGCGACGCA GATCGACACG CCGGTGCTCA GCTCGGGCAA TAGCAGCCTG TGGCCGGGGG CCGACGGCCT CACCGACACG GGCGGCGCGG GCGATGGTCC GTCGCCCGCC GGTGCCGGCG GTGCGTCCCC GGGCGGATTG CCGGTCGGTG CGCCGGGCGC GGTCCCGGGC GGCAGCGATG CGACGCCGTC TTTCGGCAGC ATCGGCAACC AGATCTTCGA CGAGGGCGCC GCGGGCGCCT CGGGCGACGG GAGCGGTGGC CAGTCCTCGG GCGGGGGCGG CCAGGCGCCG GCGCCGGGCG GCTCGTCGCC GTCGTCCGAC AAAGACCGAT CCTCGGATGG AGGGAAGCAG CAGTGA
|
Protein sequence | MRSSVSDIFA AERRRQRSMP RTMTSMAVLA TVLAMPVKES LAANSYGFLG IPDGCVTATC SGTGYTIDRT FATSTDTVTV NAPSALITWV PNDTNGGTAS VQDAINFLPS GNTVNYVANG ALPNGEYTIL NRIVPQPNAD GRSIALNGTI TTDTAKGNVW FYSPNGIVVG STASFDVGGL LLTTSNITAG NVTLDGDGYV TAVNTQTSIA GAAVTIQSGA DITAGNYFGI LAPKIDQGGS VESDGGIAYI SAEQAKLSID DASGLFDIEV GVGADSMGGI LHGGTSRGVI ATADGLVHPI YFVAVPKNDA ISMLLGGNIG FQVAQGAEAT EHGIVLSAGR DIAFGSLNAG GGLSQGAGTG SVSISGLPTL NAGTLIQAGV PLTVSGNVQI AANGAPVTID AASDINFTGT VSIDTSVLGN PVAPVQGGAI SLLAQGGTID FGSDVTLLAN AQAFDEQNGT GGAILISALG PNSGIAFHGG LTGAADGNGG LPSSVGDGGD GFGGTILVTA GGGGSITADT GMSLTAVGRG GETFEGTGGN GQGGSVALVA NPGGTITVPQ GSITLDASSV GGNVYQTGTG GNALSVLPGS GSPKAYVALQ ANGGVIDLGS DQVPVDLYLR ANGTAGDGPT GGTAHGGAIE ILASGAGGGI NANSAVSAQA YARVGSDGSF GGGNATGGLF NMVSSGGLIT LNSLDAQVYG QGGDNNNSGA GGIGTGGKAY IEAQGGTALA IEGSTFLDAS GTGGQGITAG GAGTGGAAQI FANGGDITLG GGQTIINLDA SGTGGGSEVD GLGSGVGGTG TGGVAGFEAN GATLTLDGSA VANASGTGGN GRTGGNGVGG RPPGLETQDY DPFYGVYAQA SDGAINATVS LTLVADGYGG STASDGSNVG HAGNGTGGYA EMLATNIGGP TSLATISAPS VQMSAAGAGG HGGDPGADGV GSSGGVGQGG VVVMGAYAGR GQLSLGTVSL FATGVGGDGA DGESQSTGNG GTGGSGGAGI GGNIQAGVFS GPQTAENNGS ADIQSLQMDA SGLGGAGGSG GSSGEGTGGN GGAGGAGIGG GGTDAGSLAI LARGAPVTIG SATLNAFAQG GVGGSGGSGL STSGTNGADG AGKGGHFALL STYRYQTTTV PGSLSIDVLN ADVSGAGNTE APSGPTTAGN FVIATNGGDL TISSGDVYAN GQLAPTNQFQ NVFLDVTTVS EVSAGNGTVT LGTSPNTFRI HTEAAQGFHP VNFYTDPGGQ FAANLANCTL NGVVCQSVAR AVSPPPPPPP PPPVSPPPPP PPPPVSPPPP PPPPPVSPPP PPPPVSPPPP PPPPPPVIED PAVNETVTTE TTNITSSIQS TLTGSRTAGG SIKSTETTGT TAPGSDPAGG SAASAGDDEG GDGDGSDDAG GSTSSGGSVG GPNLLIDTSN IGSDATQIDT PVLSSGNSSL WPGADGLTDT GGAGDGPSPA GAGGASPGGL PVGAPGAVPG GSDATPSFGS IGNQIFDEGA AGASGDGSGG QSSGGGGQAP APGGSSPSSD KDRSSDGGKQ Q
|
| |