Gene Swit_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_1301 
Symbol 
ID5200529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp1455465 
End bp1458434 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content71% 
IMG OID640580848 
Producthypothetical protein 
Protein accessionYP_001261804 
Protein GI148554222 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC ACGTCGTTCT GGCGGTGACG ATGGCGATGG CGGGCGTTCT CGCGACATCG 
GCGCCCGCCT TGTCCGCAGC GCGGCCGGCC AGCGCCGCCG GCCCGTCGCC GGCCGAAAAG
GCGCGGCTGG AGAAGATGCG CAAGGCCGCC GACGACCTGG ACGAAGGCGC CGATCCGGCG
GCCTATCGCA AGGCGTGGGA AGCGGTGCTG GCCTATGGCG AGACGCTCTA TCCCGCCGGA
CACCCCGAGC TGGCCTGGCT GGAGGGCGAA CTCGTCACCG CCGATTATCT GCAAGGCGAC
GTCACCGGCG CGCTCGCACG GGCCGAACGC CTGGGCGCGC GGCTCGAAGC CGGCGGCCCC
GAATATCGCG ACCGGCGGAT GGAGCTGGCC AATGCCCAGG TCGTGATCCT GATGACGCTC
AGCGACCATG ATCGCGCGCG CAAGCTCGCC GCCCAGGTCC TCGAATGGCG GATCGCCAGG
AGCGAGGGCA AGCCGAGCAG CAACGTCGCC GCGGCCTATT CCAACTACGC CAATGCGGAG
TTCGAGTTCG GCAATTTCGA CAAGGCGATC GAGCTGGTCC GCAAGGCGAA TGCCGAGGAC
CGCCGTCACG AGACCATCCC GGTGAACGCC GCGCCGCGCT TTGCCAACCT GCCGGTCTTC
CTCCTGCAGT CGGGCCGGCT GGAAGAGGCG ATCGAGGAAG CGCGCGCGAC GCAGGCGACG
CTGGAGGGCT TCATGCCCAA GGGCCATCCC TATCTGGCGA CCAACCTCAA TACCCTCGCC
CGGATATTGC TCACGCTCGG TCGCCCCGGC GAGGCGGAGA CGGTGTCGCG CAAGGCCGTG
GACATCGCCG TGGCACGCTT CGGCCAGAGC CAGCAGGCGG TGAGCTACAT GGTCACGGTC
GCGCAGGCGC TGATCGCCCA GGGCAAGACC GACGAGGCGA TCGCCACCGC GCGCCAGGCG
GCGGACATCC TGACCAAGGG GATCGGCCCG GACGCCCAGC GCACGCTGAT GGCGCGCGAA
AGCTATGCCG ATGCGATCGC CGCCTCGGGC GATCGCGAGC AGGCCATCGC GATCCTGCGC
GAGGTGAACG CGACGCGCGA CCGCAAGCTG CCGCCCTTCC ACCGCGACCG CATCACCGGC
CGCGACCAGC TCGCGATCTT CTCGTTCGAG CAGGGCGACA TGGCGGCGGC GAAGACCGCC
CAGACGGAGG CGCAGGCCCT GCGCCGCTCG ACCTTCCCCG CCGAAGATAT CGCCAACCTC
ACCGGCGAGG CGCGGCTGGG CGCGATCGAA GTGCATGGCG GCGACAAGGC GGGCGGGCTG
AAACGCGTCG AGGCGGCGGC GGCGCTGCTC GACGAACGGC TCGCCCGGCT GCGTGCGGCC
GGCACCCGGC GATCGGGCCT CGAGCTCGAG ATCCGGGCCG CCTATGGCTG GGCGCTCGAT
GCGGCGGTGA CCGCCGGCGA CGAGGCGCTG GCCTTCCGCC TGGCCCAGCG GATGATGGAG
AGCTCGGCGG GGCGCGCCGT ACAGGAGGAG GAAGCGCGAT CGGTAGCCGG CGATCCGCAA
CTGGCCGAGC TCATCCGCGA GCGCCAGGAC GCCGCGATCG AGCTAGAAGC CCTGCTCGAC
CGCCAGCTTC GCCTCGCGGG ACGTGGCGCC GACACGGCGA CGATCGAAGC GGTGGCCGGC
CAGCGCAAGG CGGCCGCCGT GCGGCTGGAA CAGCGCACCG CCGCCCTGGC CGCCCGCGCG
CCCCAACTCG TCGCGCCGGC GGGCTCGGAG CCGCTCACGC TCGAAAGCGT GCGCGAGGCG
CTCCGATCCG ACGAGGCATT GTTGATCGCC GGGGTGTCCG AAAGCCGCAC CACCCTGTTC
GCCATCACGC GGGATCGCGT GTCGATGGCC ATGTCCCCGG CCACCGCGCA ACAGATCGCG
ACGCTGGTGA CCAGGCTCCG CACGGGCATC AGCCTACAGG CAGCGGCAGG GGCCGGACGA
AGCTTCGCCC TCCGCGCCGC ATCGCCGACC CGCGCCGCGA TCTCGGATGA CCGGACGGCA
ACGGGTTTCG ATTTCGATGC CTCGGCACGA CTGCACGACA TCCTGTTTCC CAAGGCCATT
CGGAGCACGA TAGCGAACAG GTCCCGCCTG CTGGTCGCGG CCAATGCGAG CCTGACGACG
CTTCCCCTCG CCGTGCTCGC GCCGCAACGG AGCGATCCGA CCCTGCGCAA TGCGCATTGG
CTGATCCGGG ACCATGTCCT CGTCACCCTG CCGTCCATCG CCAGCGTCAC GTCCGCCCGC
TCGACCGGCA CCGGACGGAA GGTGCACAGC TTCTTCGCGG TCGGCGCGCC GGAGCTGGCG
CCCGCGGGAA CGGCCATGGC GTTCCGGTCG GCCGACATGG CCCGGCAGGT CCGGGACCTG
CCCGCGCTGC CCGCCACGGA GCCCGAGCTT CGCACCGTCG GCCGCGCGCT CGCGGCGCCC
GAGCAATCAA TCCTCACCGG CAGCCGCGCG ACCGAGCAGG CGATCCGCAC CGCCGACCTG
ACGCGCACCA ACGTCCTCGC CTTCGCCACC CACGGCCTGA TGGCCGGCGA TCTCGACGGG
CTCGACGAGC CCGCGCTGGT GATGACCCCG GGCGGCTCCG ACGACGGGCT GCTCACCGCC
AGCGAGATCA TGCGCCTGCG CCTGGCCGCG GACTGGGTGA TCCTGTCCGC CTGCAACACG
GCGGCGGGCG GCAGCGGCGA CGACAGCGGC CTGGCCGGCA TCGCCCGGGC CTTCCTCTAT
GCGGGGGGCC GCAACCTGCT GGCGTCGCAC TGGGCCGTCC GCGACGATGC CGCGGCCTAT
CTCTCGGTCG GAACCGTCCG GAAATATGGA CGCGGCGAAG ACCCCGCGCG GGCGCTGCGC
GAGGCGATGC TGCGGATGAT CGACAAGCGG CCGTTCGAAG GAGCCGAGCA GCCGGTCAAC
TGGGCGCCCT TCGTCTTCGT CGGGCGTTAG
 
Protein sequence
MRKHVVLAVT MAMAGVLATS APALSAARPA SAAGPSPAEK ARLEKMRKAA DDLDEGADPA 
AYRKAWEAVL AYGETLYPAG HPELAWLEGE LVTADYLQGD VTGALARAER LGARLEAGGP
EYRDRRMELA NAQVVILMTL SDHDRARKLA AQVLEWRIAR SEGKPSSNVA AAYSNYANAE
FEFGNFDKAI ELVRKANAED RRHETIPVNA APRFANLPVF LLQSGRLEEA IEEARATQAT
LEGFMPKGHP YLATNLNTLA RILLTLGRPG EAETVSRKAV DIAVARFGQS QQAVSYMVTV
AQALIAQGKT DEAIATARQA ADILTKGIGP DAQRTLMARE SYADAIAASG DREQAIAILR
EVNATRDRKL PPFHRDRITG RDQLAIFSFE QGDMAAAKTA QTEAQALRRS TFPAEDIANL
TGEARLGAIE VHGGDKAGGL KRVEAAAALL DERLARLRAA GTRRSGLELE IRAAYGWALD
AAVTAGDEAL AFRLAQRMME SSAGRAVQEE EARSVAGDPQ LAELIRERQD AAIELEALLD
RQLRLAGRGA DTATIEAVAG QRKAAAVRLE QRTAALAARA PQLVAPAGSE PLTLESVREA
LRSDEALLIA GVSESRTTLF AITRDRVSMA MSPATAQQIA TLVTRLRTGI SLQAAAGAGR
SFALRAASPT RAAISDDRTA TGFDFDASAR LHDILFPKAI RSTIANRSRL LVAANASLTT
LPLAVLAPQR SDPTLRNAHW LIRDHVLVTL PSIASVTSAR STGTGRKVHS FFAVGAPELA
PAGTAMAFRS ADMARQVRDL PALPATEPEL RTVGRALAAP EQSILTGSRA TEQAIRTADL
TRTNVLAFAT HGLMAGDLDG LDEPALVMTP GGSDDGLLTA SEIMRLRLAA DWVILSACNT
AAGGSGDDSG LAGIARAFLY AGGRNLLASH WAVRDDAAAY LSVGTVRKYG RGEDPARALR
EAMLRMIDKR PFEGAEQPVN WAPFVFVGR