Gene Swit_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_1100 
Symbol 
ID5198221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp1236880 
End bp1240029 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content71% 
IMG OID640580647 
Producthypothetical protein 
Protein accessionYP_001261604 
Protein GI148554022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCGC AACGCCCTAA ACCTCGGCCC CGGCCGTTTC GCTACCTGAC CATGAGCGAC 
CTCAGGCCGG TGTGGCTGAA AAGGACCGCG CGCGGTCTCG TCGTCCTGGT CGGCCTGGTG
TTGATCCTGC TCCTGGCGGG GTCCACGGCC CTTTGGTTCA CGGTGCAAGA CCGGTTGAAG
ACAGCGGACT TCGCGCCGGC GAAACCGCTT TCGTCCTACG CAGACGAGGC GGCCCCTGCG
CCTCGGCCCG CGCCCGCGCC GGCCCCGTTC GACTGGGCCG CGATCGACGC CCCGCCAACC
TCGGCCTTGC CCTGGGTGCG GTGGTGGTGG CCAGGCGGCA ACGTCGAGCC GAGCGAGTTG
CGACACGAGC TCGATCAATT GAAGGCCGCG AACTTCGGCG GTGCCGAGGT CCAGCCCTTC
GCCTTCGGCG TCAAGGCGGT GACGGACAAG GACCCCGCGG CGCGGGCGCG CATCGCCGCC
TTCGACACGC CTGCCTATTT CGCGATCCTG CGCGGCGTGA TGAAGGACGC CGCCGATCGC
GGCCTGCGGA TCGACCTCAC CCACTATAGC GGCTGGCCGG CGGGCAGCCC CGCCGTGGGG
GTCGCGGACG GACTGCAGAG CCTCGCCTGG TCGGAGCGTC GCTTCCGCGG CGGACGCCAG
GTCGAGATCG CCTTGCCGCG CCCGAAGCCC AGCCTCAACG CCTTGCTGCT GGCGGCCTCG
TCACTGATCT CGCCGATGGG CGATATCAGC GATTTCGATC CCTCGCGCGC CCAGTTGCTC
AGCGTCCTCG TGGCCCGGCC AAAGGGCGGG GGCCATAGCC TGTTCTCGAC AAAGGACACG
CTCCGCCTCG ACGCGGCTTC GGTCCAGGTC GTGGACGCTC AGGTGAAGGA CGGCAAGCTC
GTCTGGGACG CGCCGCCGGG TCGCTGGGTG CTGGTCGCCT CGTGGATCCT GCCGGCCGGC
CAGCCGCCGA TGTTCAGCGC GCCCGAGCCG CCGGGCTATG CGGTCGACGT CTTGCGCGCG
GCCAATGTTC GGGCGCACTA CAACTACGCC TTCGGCCGGC GCACGGGCCT GGACGCCGAA
GCCGGCCATG CCTTCCGCGG GATCTTCAAC GACAGCCTGG AATTCGCCGT CGATCGCCTG
GGTTCCGCCG ACATCCTGGC CGAGTTCAGG CGTCGGCGCG GCTACGACCT GCGCCCTCAC
CTGCCGGTCG TCTTCGTCGA CGCCGCCGAC AGCTTCTATG TGAGCGAATT GCTGCCCTCG
CGGAAACCGA ACTTCACCCT GGGCGAGATG GATGACCGCA TCCGCCACGA CTACCAGCTG
ACCCTCTCGG ACCTCGTCGT GGAGCGCTTC CTGGACGAGA CGCGGACGTG GGCCGCGGCG
CGAGGCCTCA AGTCCCGCGG CCAGGGCTAC GGCATGGACA TCGACGTCCT GCGCGCGCTC
GGCGCCAACG ACATACCCGA GACCGAGCAG CTCTACGCCG GCGGCGGCGA GGCCTTCCTG
CGGATGGCCG GATCGGCCGC GGCGCTCTAC GGGCGCGACC TGGTGAGCGC CGAGGCCTTC
GTCTGGGCCG ACCAGGACTA TGCGTCCACG CCCGCCAAAC TGAAAGCCGC GGCCGACAAG
CTGTTCTTGT CCGGGGTCAA CCACGTCATC TACCACGGCT TCCCCTACGA CTGGCGCGCC
GGGGATCGCG ACCGCTGGTT CGGGGATCAG GGCTGGGCCC CCTTCGCCAG CGATCCGATG
GCCGTCTTCT CGGACAACTA TTCGCCACGC AATCCGCTGT GGGCCGATCT GCCGGCGCTC
AACGCCTATA TCGGCCGTTC GCAGAGCCTC CTTCGGCAAG GACGCCAGGC GGCTGACGTC
CTGATCTACT ATCCGTTCCT GGGCTATCGT GCGACGGGCT ACGGCCCCGA CGAGCTCAAG
GAGCCGCTGT TCCTCGGGGC CTTCCCCTCC GCCGCCCCCG CAGGCCCGCC GCCGAGGACG
GGGATGCAAG GAACAGACGA GCGGATCCTC TGGTTGCGCA AGATGGCGCC GGTGTTCGAC
GCGCTGAACC GGCGCGGCCT CACCTGGGGC TGGGTGAATG GCGACGGCCT GCGCAACCAA
CTTCTGCCCG ATGGCCGCAT GAGGTCCGGC GCGTCCTACG GCGCCATCCT GCTGGCCGAG
GTCGAGGCCA TGGCGCCCGA AGACCTCGCC GCCGTCCAGG CGCTGGCGGC GCACAAGGTC
CCGGTGGCCC TGTACGGCCG CACGCCCGTC CGCCAGCCGG GCTATCTGGA CGCCAAGGCC
GGTGACGCCC GTGTGCGCGC ATCCAGCGCG GCGCTGGCGG CCGCAGCATC GAGCGCCCGG
ACGCCCGCGG CTCTCGTGGA CGCTATCGTC GCGGCGAGCA GGCCCGATCT CCGGTTCGCG
GGCCGCAGCG CGTTGCAGCG CTACTCCCGC ATCCTGGCGA ACGGCGGCCG GATCGACTTC
CTGGCCAATC CCGGCGCCAG CGAGGCGAGC ACGCTGATCT CCACCCAGGA CCACGCGCAG
GCCTGGTGGT TCGACGCGCG GACCGGCCGA GCGGCGCCGG CCGTGCGGGA CGCCGAGGGC
CGCATCGCCC TGACGCTCCA GGCCTATGAC TCGCGGTTCC TCATCCGAGG CGTCCCCATG
CCGGCCCACC TCGCCTCGTC CGCCATGACA ACCAGCCCGG TCACGCGCGC ATGGCCGTTG
AAGGGCTGGA GCCTGCGCGT CGGCGGCGAC GTCCGAACGT CGGGGCTCTT CGACTGGCGC
AGCGACGAGG CGCTGCGCCA TTCCAGGGCC GAAGGCGTCT ACAGCGCCAA GGTCACCCTG
CCCGAGTTGG CGCCCGGGGC GCGTCGCGGC CTGCGCCTCG GCATCGTTCC AGGCATGGCC
ACGGTGAGGA TCAACGGCCG CGACGCTGGC CGAGCCAGCC TGCCGCCCGG CGAATTGGAC
GTCAGCGGCC TGCTGAAGCC CGGCGTGAAC CGCATCGAGA TCGTCTACCG GCCGACGCTG
CGCAACTGGA TGATCGGCCG CGCGGCGGAG GGCGACAAGC GCGCCGCGGC GTTCCGGTCG
CGGACCGAGG CGCTTACGCC CGCGGGCCTC CAGCAGCCGA TCAGCATCGT GGAGCATGGC
GCGCCCGACG CGCGCGGCCA TACGAAGTGA
 
Protein sequence
MPSQRPKPRP RPFRYLTMSD LRPVWLKRTA RGLVVLVGLV LILLLAGSTA LWFTVQDRLK 
TADFAPAKPL SSYADEAAPA PRPAPAPAPF DWAAIDAPPT SALPWVRWWW PGGNVEPSEL
RHELDQLKAA NFGGAEVQPF AFGVKAVTDK DPAARARIAA FDTPAYFAIL RGVMKDAADR
GLRIDLTHYS GWPAGSPAVG VADGLQSLAW SERRFRGGRQ VEIALPRPKP SLNALLLAAS
SLISPMGDIS DFDPSRAQLL SVLVARPKGG GHSLFSTKDT LRLDAASVQV VDAQVKDGKL
VWDAPPGRWV LVASWILPAG QPPMFSAPEP PGYAVDVLRA ANVRAHYNYA FGRRTGLDAE
AGHAFRGIFN DSLEFAVDRL GSADILAEFR RRRGYDLRPH LPVVFVDAAD SFYVSELLPS
RKPNFTLGEM DDRIRHDYQL TLSDLVVERF LDETRTWAAA RGLKSRGQGY GMDIDVLRAL
GANDIPETEQ LYAGGGEAFL RMAGSAAALY GRDLVSAEAF VWADQDYAST PAKLKAAADK
LFLSGVNHVI YHGFPYDWRA GDRDRWFGDQ GWAPFASDPM AVFSDNYSPR NPLWADLPAL
NAYIGRSQSL LRQGRQAADV LIYYPFLGYR ATGYGPDELK EPLFLGAFPS AAPAGPPPRT
GMQGTDERIL WLRKMAPVFD ALNRRGLTWG WVNGDGLRNQ LLPDGRMRSG ASYGAILLAE
VEAMAPEDLA AVQALAAHKV PVALYGRTPV RQPGYLDAKA GDARVRASSA ALAAAASSAR
TPAALVDAIV AASRPDLRFA GRSALQRYSR ILANGGRIDF LANPGASEAS TLISTQDHAQ
AWWFDARTGR AAPAVRDAEG RIALTLQAYD SRFLIRGVPM PAHLASSAMT TSPVTRAWPL
KGWSLRVGGD VRTSGLFDWR SDEALRHSRA EGVYSAKVTL PELAPGARRG LRLGIVPGMA
TVRINGRDAG RASLPPGELD VSGLLKPGVN RIEIVYRPTL RNWMIGRAAE GDKRAAAFRS
RTEALTPAGL QQPISIVEHG APDARGHTK