Gene Swit_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_5067 
Symbol 
ID5195938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009508 
Strand
Start bp181099 
End bp184008 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content60% 
IMG OID640579499 
Producttransposase Tn3 family protein 
Protein accessionYP_001260447 
Protein GI148551017 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.0388929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTG GCGTTTCGAG TGGGGATCTG ATCGGCTCCT GGAGCCTGAG TTTTTCGGAC 
ATCGCGTTCG TGACCGGCAA AGCGGAGACG GCACGACTGG GATTGGCGGT TCAGCTTAGA
TTTTTCGCCG GGCATGGCTT CTTTGTGCCG GATCATGCGT CGATACCCTC CGACGGTGTC
TTGTATCTGG CGGAGCAACT TGGTCTCGAT GCCAAATCCG TGAACCACTA TGATTTTTCC
GGGCGCACCG CCCGCCGGCA TTGCGCGGAG ATTTTGCGGC ATCTCGGGTT CCGCCGTATG
ACGCAGACGG ATCGCAGGGC GTTGTCGAGG TGGATTTCCG ACGATCTTTG TGCGGGCGGG
CAGCCGATCA ATGCCATGCT CGAGCATGTT TTCCTGTGGT GCCGCGACCG CCGTATCTAT
GGGCCGTCGC GCAAGGAGCT GGAACGCCTC GTCCGTTCGC AACGACACCT CTATCTGGAG
GCCCTGTTGG CCCGAGTCCG CGATCGGCTT GCGCCGGATG CGGTCGCCTT GCTGGAAGCC
TCGCTCGCCG ATCCCGATGG CCCGACCGGC TTCAACACGA TGAAGGGGGA TGCAGGTCAG
GCGACGCTCG AAAACATTCT TGGCGTGACC GCCAAACTCG CCTTTATCCA ACGGCTTGCT
CTTCCCCGAG ATTTCCTATC GGTCACGGGC AAGGCATGGG TCGATCAGAT CGTTCGCCGG
GTTGCCGGCG AGAAAGCCTC GGAGATGCGC CGGCATGTAC CGGCGCGCCA GCTCGGGCTC
TATGCCGTTT ATCTGATGGC GCGGGAAGCT CAGCTTACGG ATGCGATGGT CGACCTGCTG
ATCGAGACGG TCCATAAGAT CGGATCGCGC TCGAAACGCA AGGTGGTGGG CGATATCGCG
AAAGACATCG AGCGGGTCTA TGGCAAGGAG CGACTCCTGG TCGAGATTGC CAGCGCTTCG
ATCGACGATC CATCCGGGCG CATCTGCGAT GTCATTTTCC CAATCGCCGG CAAGGACAAA
CTGGCGGCGA TCATCAAGGA AAGCCAGGCA AAGGGCGCCT TGGATCGGCG GATCTACAAG
GTGATGCGGA GGTCATGGGC CAATCATTAT CGCCGTATGC TGCCAAGCCT GCTTTCGGCA
CTGGAGTTCC GGTCGAACAA CGCCGTGTGG CGTCCGGTGC TGGCGGCCCT GGACTGGATC
AGAAGCAAAG TGGATGATGG ATGCCGCTAC GTGCCGCCGC ACGCAGTGCC GGTCGACGAG
GTCATTCCGG CGAGATGGCG CAGTTCCGTC ATTGATGAAG AGGGGCGCGT AAACCGGATC
AGTTATGAGC TTTGTGTCCT CGCGCAACTG CGCGATCGCA TCCGTTCTAA GGAAATCTGG
GTTGTCGGGG CGGACCGATA CCGCAATCCC GATGACGATC TTCCCAAGGA CTTCGATGCG
CGGCGAGAAG CATATTACAC AGGATTGAAC CTGACGGCGG ATGCGCGTGC ATTTTCAAGC
GCCATCCGGG AAGAGCTTGC TCAGGAACTG TTGCTCCTCA ATGCCAATAT TCCCCGGAAC
GACAAGGTTC GGCTGCTGTG GCGCGGCGAG AACCGTATAT CTCTCACCCC GTTCAAACCC
TTGCCCGAAC CCAGGGGTCT CGCCTCGATC AAGACCGAGA TCGGCCAACG CTGGCCGATG
ACCGGGCTGC TCGACGTACT GAAGGAGGCT GCCCTTGATA CGGGACTTCT CGAAGCGTTC
GAAACATCGG CCTCGCGTGT TGCACTGCCG AAAACCGCGC TGGATCAACG TCTCCTGCTA
TGCCTCTACG GCCTGGGAAC GAATGCCGGG CTCAAGCGGA TCGCCGGCGC CACCCCCGAT
GTCAGCTATG AAGAGCTGCT GCATGTCCAT CGCCGCTTCG TTCATGCCGC GGCGCTCAAG
GAGGCGTGTG CCAGGGTTGC GAATGCGACC CTGGCAATCC GCAATGCTGC AGTCTGGGGG
GACGCCGGCA CGGCCTGTGC GTCAGATTCC ACAAAGTTCG GAGCCTGGGA TCGCAACCTG
ATGACGGAAT GGCATGCGCG TTATGGTGGA CGGGGCGTCA TGATCTACTG GCATGTCGAA
CGACGCGCGA CATGCGTCTA TTCCCAGCTC AAGCGCTGCT CTTCCTCCGA GGTCGCCTCC
ATGATCGAGG GCGTGCTGCG CCATTGCACC GACATGGAAA TCCAGCGACA ATATGTTGAT
AGTCATGGCC AAAGCGCGGT TGGCTTTGCA TTTTGCCGGC TTCTCGGATT TGAGCTTGCA
CCCCGCCTGA AAGCGATCGC TCGCCAGAAG CTGGCTCTTC CCGATGTCGG CATGCGAACG
CGGCTTCCCC ACTTGCAGCC GATCCTCTCC AGTCCGATCA ACTGGGATGA GATCGAGCAG
CAATATGACG AGATGGTCAA ATATGCAGCC GCGATGCAGA CAAAAACCGC CGACCCGGAG
GCGATCCTGC GCCGGTTTAG CCGCTCCGAG GTGATGCACC CGACCTACAA GGCGTTGAGT
GAGCTGGGCC GCGCGGTCAA GACGATCTTC CTGTGCCGGT ATCTGCGCGA GGAGTCCTTC
CGCCGCGAAA TTCATGAAGG CCTGAATGTC GTTGAAAACT GGAACAGTGC CAATGGGTTC
GTTTTCTTCG GCAAGGGCGG CGAGATCGCC ACTAACCGCA TCGATGAGCA GCAGCTCTCG
GTCCTGGCGC TACATTTGCT GCAAGCGTCG CTTGTCTATG TGAACACCCG AATGCTTCAG
AGCGTGCTGG TGGAACCGAA ATGGACGGGC CGGATGACGC CGGATGATTA TCGCGGCCTC
ACACCGCTGA TTTACAGCCA CGTCAATCCT TATGGCCGCT TCGACCTCGA TCTGAATAGC
CGGATCGATT TTGGGCGGCT TGCTGCCTGA
 
Protein sequence
MSLGVSSGDL IGSWSLSFSD IAFVTGKAET ARLGLAVQLR FFAGHGFFVP DHASIPSDGV 
LYLAEQLGLD AKSVNHYDFS GRTARRHCAE ILRHLGFRRM TQTDRRALSR WISDDLCAGG
QPINAMLEHV FLWCRDRRIY GPSRKELERL VRSQRHLYLE ALLARVRDRL APDAVALLEA
SLADPDGPTG FNTMKGDAGQ ATLENILGVT AKLAFIQRLA LPRDFLSVTG KAWVDQIVRR
VAGEKASEMR RHVPARQLGL YAVYLMAREA QLTDAMVDLL IETVHKIGSR SKRKVVGDIA
KDIERVYGKE RLLVEIASAS IDDPSGRICD VIFPIAGKDK LAAIIKESQA KGALDRRIYK
VMRRSWANHY RRMLPSLLSA LEFRSNNAVW RPVLAALDWI RSKVDDGCRY VPPHAVPVDE
VIPARWRSSV IDEEGRVNRI SYELCVLAQL RDRIRSKEIW VVGADRYRNP DDDLPKDFDA
RREAYYTGLN LTADARAFSS AIREELAQEL LLLNANIPRN DKVRLLWRGE NRISLTPFKP
LPEPRGLASI KTEIGQRWPM TGLLDVLKEA ALDTGLLEAF ETSASRVALP KTALDQRLLL
CLYGLGTNAG LKRIAGATPD VSYEELLHVH RRFVHAAALK EACARVANAT LAIRNAAVWG
DAGTACASDS TKFGAWDRNL MTEWHARYGG RGVMIYWHVE RRATCVYSQL KRCSSSEVAS
MIEGVLRHCT DMEIQRQYVD SHGQSAVGFA FCRLLGFELA PRLKAIARQK LALPDVGMRT
RLPHLQPILS SPINWDEIEQ QYDEMVKYAA AMQTKTADPE AILRRFSRSE VMHPTYKALS
ELGRAVKTIF LCRYLREESF RREIHEGLNV VENWNSANGF VFFGKGGEIA TNRIDEQQLS
VLALHLLQAS LVYVNTRMLQ SVLVEPKWTG RMTPDDYRGL TPLIYSHVNP YGRFDLDLNS
RIDFGRLAA