Gene Swit_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3191 
Symbol 
ID5198882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3502698 
End bp3507293 
Gene Length4596 bp 
Protein Length1531 aa 
Translation table11 
GC content71% 
IMG OID640582737 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001263676 
Protein GI148556094 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.670297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCA GCGTTTCCGA CATCTTCGCC GCCGAGCGGC GCCGCCAGCG GTCGATGCCG 
CGCACGATGA CCTCGATGGC GGTGCTCGCC ACCGTCCTCG CCATGCCGGT CAAGGAAAGC
CTCGCGGCGA ACAGCTACGG CTTCCTGGGC ATCCCGGACG GCTGCGTGAC CGCGACCTGC
TCGGGCACCG GCTACACCAT CGACCGGACC TTCGCGACGA GCACCGACAC GGTGACGGTC
AACGCGCCTT CCGCGCTCAT CACCTGGGTC CCCAACGACA CCAACGGCGG CACCGCCTCC
GTCCAGGACG CGATCAACTT CCTGCCCTCC GGCAACACCG TCAACTATGT CGCCAATGGC
GCTTTGCCCA ACGGCGAATA TACGATCCTC AACCGGATCG TTCCCCAGCC CAACGCCGAC
GGCCGGTCGA TCGCGCTGAA CGGCACGATC ACCACCGACA CCGCGAAGGG CAATGTCTGG
TTCTACAGCC CCAACGGCAT CGTCGTCGGT TCCACCGCCT CGTTCGATGT CGGCGGCCTT
CTCCTGACGA CGTCGAACAT CACCGCCGGC AACGTGACGC TCGATGGCGA CGGCTACGTC
ACCGCGGTCA ACACCCAGAC CAGCATCGCC GGCGCGGCGG TCACGATCCA GTCGGGCGCC
GACATCACGG CCGGCAATTA TTTCGGCATC CTCGCGCCGA AGATCGATCA GGGCGGCTCC
GTCGAATCGG ATGGCGGCAT CGCCTATATC TCGGCGGAGC AGGCGAAGCT GTCGATCGAC
GACGCGAGCG GCCTGTTCGA CATCGAGGTC GGGGTCGGTG CCGACTCGAT GGGCGGCATC
CTCCACGGCG GCACCAGCCG GGGCGTGATC GCCACCGCCG ACGGCCTCGT CCACCCGATC
TATTTCGTCG CGGTGCCGAA GAACGACGCG ATTTCGATGC TGCTCGGCGG CAATATCGGC
TTCCAGGTGG CGCAGGGCGC GGAAGCCACC GAGCACGGCA TCGTCCTGTC GGCCGGTCGC
GACATCGCCT TCGGGTCGCT CAATGCGGGC GGCGGCCTCT CCCAGGGGGC TGGCACCGGA
TCGGTCAGCA TTTCCGGCCT GCCGACGCTC AACGCCGGCA CCCTCATCCA GGCGGGCGTT
CCGCTGACCG TCTCCGGCAA CGTGCAGATC GCGGCCAATG GTGCGCCCGT CACCATCGAC
GCGGCGAGCG ACATCAACTT CACCGGCACC GTTTCGATCG ATACCTCCGT CCTCGGCAAT
CCCGTCGCCC CGGTGCAGGG CGGCGCGATC TCGCTCCTCG CGCAGGGCGG CACGATCGAC
TTCGGCAGCG ACGTCACGCT GCTGGCGAAC GCCCAGGCTT TCGACGAGCA GAACGGCACC
GGCGGCGCGA TCCTGATCTC CGCGCTGGGA CCGAACAGCG GGATCGCCTT CCATGGCGGC
CTCACCGGGG CGGCGGACGG CAATGGCGGC CTGCCCAGTT CGGTCGGTGA TGGCGGCGAC
GGCTTCGGCG GCACCATCCT GGTCACCGCC GGCGGCGGCG GATCGATCAC CGCCGACACC
GGCATGAGCC TGACGGCGGT CGGCCGGGGT GGCGAGACCT TTGAGGGCAC GGGCGGCAAC
GGCCAGGGCG GCTCCGTCGC GCTGGTCGCC AATCCCGGCG GCACCATCAC CGTGCCGCAG
GGCTCGATCA CCCTCGACGC GAGCTCGGTC GGCGGCAATG TGTATCAGAC GGGCACCGGC
GGCAACGCCC TGTCGGTGCT GCCCGGCAGC GGTTCGCCGA AGGCCTATGT CGCGTTGCAG
GCCAATGGCG GCGTGATCGA CCTTGGCAGC GACCAGGTTC CGGTCGATCT CTACCTGCGC
GCCAACGGCA CGGCCGGAGA CGGTCCGACC GGCGGCACCG CCCATGGCGG CGCCATCGAG
ATATTGGCGT CGGGCGCGGG CGGCGGCATC AACGCCAACA GCGCGGTGAG CGCCCAGGCC
TATGCCCGGG TCGGCAGCGA CGGCAGCTTC GGCGGCGGCA ACGCCACCGG CGGCCTGTTC
AACATGGTGT CGAGCGGTGG CTTGATCACC CTCAATTCGC TCGATGCCCA GGTCTATGGC
CAGGGCGGCG ACAACAATAA TAGCGGCGCG GGCGGCATCG GCACCGGCGG CAAGGCCTAT
ATCGAGGCGC AGGGCGGCAC GGCCCTCGCG ATCGAGGGCA GCACCTTTCT CGATGCGAGC
GGCACCGGCG GGCAGGGCAT AACGGCGGGC GGCGCCGGCA CCGGCGGCGC CGCGCAGATC
TTCGCCAATG GCGGGGACAT CACCCTGGGC GGCGGGCAGA CCATCATCAA CCTCGATGCG
AGCGGCACCG GCGGCGGCAG CGAGGTCGAC GGCCTTGGTT CCGGCGTCGG CGGCACCGGC
ACCGGCGGGG TCGCCGGCTT CGAGGCCAAT GGCGCCACGC TTACCCTGGA CGGCTCGGCG
GTCGCCAATG CCAGCGGCAC CGGCGGCAAC GGCCGAACCG GCGGCAACGG TGTGGGCGGT
CGTCCGCCGG GCCTGGAAAC GCAGGATTAT GATCCCTTCT ACGGCGTCTA TGCGCAGGCG
AGCGACGGCG CGATCAACGC GACGGTCAGC CTGACCCTGG TGGCGGACGG CTATGGCGGC
AGTACTGCCT CCGACGGCTC CAATGTGGGG CATGCGGGCA ACGGTACCGG CGGCTATGCC
GAAATGCTCG CCACCAACAT CGGTGGCCCC ACCTCCTTGG CGACGATCAG CGCCCCTTCG
GTGCAGATGT CGGCGGCGGG CGCGGGCGGT CACGGCGGCG ATCCCGGCGC GGACGGGGTC
GGCAGCTCCG GCGGCGTCGG CCAGGGCGGC GTCGTCGTCA TGGGCGCCTA TGCCGGCCGG
GGCCAGCTTT CGCTGGGTAC CGTTTCGCTC TTTGCCACGG GTGTTGGCGG CGACGGCGCC
GACGGCGAAT CGCAGAGCAC GGGTAACGGC GGCACCGGCG GCTCCGGCGG CGCCGGTATC
GGCGGCAATA TCCAGGCCGG CGTCTTCAGC GGCCCGCAGA CCGCCGAGAA CAACGGCTCG
GCCGACATCC AGAGCCTCCA AATGGATGCG TCGGGCCTGG GCGGCGCGGG CGGCAGCGGC
GGGTCGTCGG GCGAGGGCAC GGGCGGCAAT GGCGGCGCGG GCGGCGCGGG AATCGGCGGC
GGTGGTACCG ACGCCGGCAG CCTTGCCATC CTGGCGCGCG GCGCGCCGGT GACGATCGGT
TCGGCCACGC TCAATGCCTT CGCCCAGGGC GGGGTGGGCG GCAGCGGCGG CAGCGGCTTG
AGCACCTCGG GCACCAACGG TGCCGATGGT GCCGGCAAGG GCGGCCATTT CGCGCTCCTC
TCGACCTATC GCTATCAGAC CACGACCGTG CCGGGATCGC TCTCGATCGA CGTGCTCAAT
GCCGATGTTT CCGGTGCCGG AAACACCGAA GCGCCGAGCG GGCCGACCAC AGCCGGCAAC
TTCGTGATCG CGACGAACGG CGGCGACCTG ACGATCTCCA GCGGGGACGT GTATGCCAAT
GGCCAGCTCG CCCCGACCAA CCAGTTCCAG AACGTATTCC TCGACGTGAC GACGGTGAGC
GAGGTCTCGG CGGGCAACGG CACCGTCACG CTCGGCACGT CTCCCAACAC TTTCAGGATC
CACACCGAGG CGGCGCAGGG CTTCCATCCG GTCAACTTCT ACACCGATCC GGGCGGCCAG
TTCGCGGCCA ACCTGGCCAA TTGCACGCTC AATGGCGTGG TCTGCCAGTC GGTTGCGCGG
GCGGTGTCTC CGCCGCCGCC CCCACCCCCG CCGCCGCCGG TGTCGCCGCC CCCGCCGCCA
CCCCCGCCGC CGGTGTCTCC GCCGCCGCCA CCGCCGCCTC CGCCGGTGTC GCCGCCCCCG
CCGCCTCCGC CGGTGTCGCC ACCTCCCCCG CCGCCGCCAC CGCCGCCGGT GATCGAGGAT
CCGGCGGTGA ACGAGACGGT CACCACCGAG ACGACGAACA TCACGTCGAG CATCCAGTCG
ACGCTGACCG GCTCGCGCAC CGCGGGCGGC AGCATCAAGA GCACCGAGAC GACGGGAACC
ACCGCGCCCG GTTCGGACCC GGCCGGCGGC TCGGCCGCGT CGGCGGGCGA TGACGAGGGC
GGCGATGGCG ACGGCAGCGA CGATGCCGGC GGCTCGACCT CGAGCGGCGG CAGCGTCGGC
GGCCCGAACC TGCTGATCGA CACGAGCAAC ATCGGGTCGG ACGCGACGCA GATCGACACG
CCGGTGCTCA GCTCGGGCAA TAGCAGCCTG TGGCCGGGGG CCGACGGCCT CACCGACACG
GGCGGCGCGG GCGATGGTCC GTCGCCCGCC GGTGCCGGCG GTGCGTCCCC GGGCGGATTG
CCGGTCGGTG CGCCGGGCGC GGTCCCGGGC GGCAGCGATG CGACGCCGTC TTTCGGCAGC
ATCGGCAACC AGATCTTCGA CGAGGGCGCC GCGGGCGCCT CGGGCGACGG GAGCGGTGGC
CAGTCCTCGG GCGGGGGCGG CCAGGCGCCG GCGCCGGGCG GCTCGTCGCC GTCGTCCGAC
AAAGACCGAT CCTCGGATGG AGGGAAGCAG CAGTGA
 
Protein sequence
MRSSVSDIFA AERRRQRSMP RTMTSMAVLA TVLAMPVKES LAANSYGFLG IPDGCVTATC 
SGTGYTIDRT FATSTDTVTV NAPSALITWV PNDTNGGTAS VQDAINFLPS GNTVNYVANG
ALPNGEYTIL NRIVPQPNAD GRSIALNGTI TTDTAKGNVW FYSPNGIVVG STASFDVGGL
LLTTSNITAG NVTLDGDGYV TAVNTQTSIA GAAVTIQSGA DITAGNYFGI LAPKIDQGGS
VESDGGIAYI SAEQAKLSID DASGLFDIEV GVGADSMGGI LHGGTSRGVI ATADGLVHPI
YFVAVPKNDA ISMLLGGNIG FQVAQGAEAT EHGIVLSAGR DIAFGSLNAG GGLSQGAGTG
SVSISGLPTL NAGTLIQAGV PLTVSGNVQI AANGAPVTID AASDINFTGT VSIDTSVLGN
PVAPVQGGAI SLLAQGGTID FGSDVTLLAN AQAFDEQNGT GGAILISALG PNSGIAFHGG
LTGAADGNGG LPSSVGDGGD GFGGTILVTA GGGGSITADT GMSLTAVGRG GETFEGTGGN
GQGGSVALVA NPGGTITVPQ GSITLDASSV GGNVYQTGTG GNALSVLPGS GSPKAYVALQ
ANGGVIDLGS DQVPVDLYLR ANGTAGDGPT GGTAHGGAIE ILASGAGGGI NANSAVSAQA
YARVGSDGSF GGGNATGGLF NMVSSGGLIT LNSLDAQVYG QGGDNNNSGA GGIGTGGKAY
IEAQGGTALA IEGSTFLDAS GTGGQGITAG GAGTGGAAQI FANGGDITLG GGQTIINLDA
SGTGGGSEVD GLGSGVGGTG TGGVAGFEAN GATLTLDGSA VANASGTGGN GRTGGNGVGG
RPPGLETQDY DPFYGVYAQA SDGAINATVS LTLVADGYGG STASDGSNVG HAGNGTGGYA
EMLATNIGGP TSLATISAPS VQMSAAGAGG HGGDPGADGV GSSGGVGQGG VVVMGAYAGR
GQLSLGTVSL FATGVGGDGA DGESQSTGNG GTGGSGGAGI GGNIQAGVFS GPQTAENNGS
ADIQSLQMDA SGLGGAGGSG GSSGEGTGGN GGAGGAGIGG GGTDAGSLAI LARGAPVTIG
SATLNAFAQG GVGGSGGSGL STSGTNGADG AGKGGHFALL STYRYQTTTV PGSLSIDVLN
ADVSGAGNTE APSGPTTAGN FVIATNGGDL TISSGDVYAN GQLAPTNQFQ NVFLDVTTVS
EVSAGNGTVT LGTSPNTFRI HTEAAQGFHP VNFYTDPGGQ FAANLANCTL NGVVCQSVAR
AVSPPPPPPP PPPVSPPPPP PPPPVSPPPP PPPPPVSPPP PPPPVSPPPP PPPPPPVIED
PAVNETVTTE TTNITSSIQS TLTGSRTAGG SIKSTETTGT TAPGSDPAGG SAASAGDDEG
GDGDGSDDAG GSTSSGGSVG GPNLLIDTSN IGSDATQIDT PVLSSGNSSL WPGADGLTDT
GGAGDGPSPA GAGGASPGGL PVGAPGAVPG GSDATPSFGS IGNQIFDEGA AGASGDGSGG
QSSGGGGQAP APGGSSPSSD KDRSSDGGKQ Q