Gene Swit_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3859 
Symbol 
ID5197979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp4245259 
End bp4246299 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content73% 
IMG OID640583414 
ProductAraC family transcriptional regulator 
Protein accessionYP_001264342 
Protein GI148556760 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCCC CCGCTCCCTT TCCATCGCTG ACCGTCGACG CGGTCCGCCC CCTGCTCGGC 
GTCATGGAAG CGGCCGGACT CGCCCCGGCG CATGTCCTGC GGCGCGCGGG CCTCCCGACC
GACCTGTTCG CGGGGCCGGG AAACGGCGCG CTGCGCCTGT CCGACTATTT CCGCATCTGC
GAGCAGATGG CGCTGCTCGG CGGCGACGAG AGCTGCCATG TCTCGCTGCG GCCGCTGATG
GTCGGCACGT CCGAGCTGGT CCAGGCCCGG CTGCGCGGCT GCACGACGAT GGCGGAGGTG
ATGGAGGTGC TGGCCAACAG CTACAACATC ATCCACGGCC ATCGCTACAA CCAGGTCCAG
CGGCGCGGGC CGTTGATCAG CTACGCGATC GACGACGCCG ACTTCCCCTA TGCGTTCGAC
CCGGACGACG CCTTCGTCAT CCTCTCGCTC GAATGCCTGC TCGTCTACGT CCATGTCCTG
CTGCTATCGC TCGCGCCGGG CGCCGGGCCG ATCCCGCTCC GTTCGGTCCG CACCCGCGGC
CCCGCCGCCG GCCGCAGCCA CCTCGCCTTC CTCGGCGTGC CGGTGAAGGC CTCGGCCGGC
CTGTTCGGGC TCGACTATGA CGCGGCGCTC GAAGGCGTCG GCGTCGCCCC GGCGCAGAGC
CCGGTGCTGT CGGCGCGCAC CATCTATGGC GGGGTCGCCG ACATGCTCGA CCGGATCGGG
CCGGTCGCGG CGGACGCGCC CGACGTCATC GGCCGGGTCG AGCGCGAGCT CGCGCGCGGG
CGGCTCGACC AGGCCGAGGT CGCGTCGGCG CTGGGGATGA GCGTCGCCTC GCTCCGCCGC
CGGCTCGCCG AGGCCGGGCT CGCCTTCCGC GACCTGCGCG CGCGCTATCT GAACAGCATC
GCGCGGGCGG CGCTGGAGGA CGGCGGCAGC ATCGCCGACA TCGCCGAAAC CCTCGGCTTC
TCGGACGGAC GCAGCTTCGC GCGCGCCTTC CGCCAGTGGA ACGGCGTCGC GCCGGGGGAC
TATCGCCGCA GCACCGACTG A
 
Protein sequence
MSPPAPFPSL TVDAVRPLLG VMEAAGLAPA HVLRRAGLPT DLFAGPGNGA LRLSDYFRIC 
EQMALLGGDE SCHVSLRPLM VGTSELVQAR LRGCTTMAEV MEVLANSYNI IHGHRYNQVQ
RRGPLISYAI DDADFPYAFD PDDAFVILSL ECLLVYVHVL LLSLAPGAGP IPLRSVRTRG
PAAGRSHLAF LGVPVKASAG LFGLDYDAAL EGVGVAPAQS PVLSARTIYG GVADMLDRIG
PVAADAPDVI GRVERELARG RLDQAEVASA LGMSVASLRR RLAEAGLAFR DLRARYLNSI
ARAALEDGGS IADIAETLGF SDGRSFARAF RQWNGVAPGD YRRSTD