Gene Swit_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_2047 
Symbol 
ID5197398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2298592 
End bp2299644 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content62% 
IMG OID640581591 
ProductAraC family transcriptional regulator 
Protein accessionYP_001262544 
Protein GI148554962 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC ACAAACATGT TTCGCGCCTG CGCTGCGAGG AGGTGCCGGT CGATCTCCTC 
ATCGCCCTGC TCGGCTGCGT CGCGAAGGCC GGCGGCAGTG CGGACGGCAT CCTCCGCGAC
GCGGGCGCCA CCTATCGGTT TCGCGAACTG AAGCGGCGAC GCAACTGCAC GATCGCGGAG
TTCACGCGGG CCAATCGCCT GTGCAACGAA TATCTGCGCG GCCACATTCT CGCCACCACC
GGCTGCCCGA CGCTGAACGA ACAGCAATTC TACCTGCTCG CCAAGTGCCT CGCCGCCTGT
TCCGATCTGG AGGAGGCATT GCGCACGACA GCCGCCTTCT TCGCGATGTT CGAGGGGAGG
ATCGGCGAAG CCCATTTCGA GGTGCGCGGC GAGCGGGTAC ATCTTCACAT CAACCCTCCC
CGTCGCGAGA AGAACGAGGC CGGCTTCCTC GTCGACATAT ATGGCTATGC GATCCTCCAG
ATATTCCTCG GCTGGCTGCT GGACGAGCAG CCGATCTTCG ATGCGGTCGA TCTCATCTAT
CCGCAGCCGG CTCGGGAGAG CGTCCATCTC GGGCTTTTCG ACTGCCCCAT CCGTTTCGGC
CAGCCGAGCA ATCGCTTCAG CTTCAGGAGC GATCTCCTCT CGCAGCCGGT CGTTCGCGAT
CAGGCGAGCC TCATGAAGCT GCTCGCGGAT TTCCCGTTCA ACCTGATGCT CGACGAGGAG
CAGCGCAAGC TGTGCGACCG GGTCTATACC GCAATGATGA ATAGCTACAT GAGAAGCCAT
ATGCTGCCGA CCATAGATGA TGTGGCGAAG CTGTTCAGAA CCTCGACCTG GACCTTGCGC
CGCCGCCTGA CCGAGGAGGG CACCGCCTAT TCCTCGATCA AGAAGAAGTG CCAGCTCAAC
CTCGCGACCG AGTTCCTCAA GCGATCCGAG ATGACGATCG ACGAGATCGC GGATATCGCC
AATTTCAGCG ACGCCAACGC CTTCCGCCGC GCCTTCCACC AATGGACCGG CTGTTCGCCG
ACCGCCTATC GCAAGGAACT CCTCGCCGTT TAA
 
Protein sequence
MGKHKHVSRL RCEEVPVDLL IALLGCVAKA GGSADGILRD AGATYRFREL KRRRNCTIAE 
FTRANRLCNE YLRGHILATT GCPTLNEQQF YLLAKCLAAC SDLEEALRTT AAFFAMFEGR
IGEAHFEVRG ERVHLHINPP RREKNEAGFL VDIYGYAILQ IFLGWLLDEQ PIFDAVDLIY
PQPARESVHL GLFDCPIRFG QPSNRFSFRS DLLSQPVVRD QASLMKLLAD FPFNLMLDEE
QRKLCDRVYT AMMNSYMRSH MLPTIDDVAK LFRTSTWTLR RRLTEEGTAY SSIKKKCQLN
LATEFLKRSE MTIDEIADIA NFSDANAFRR AFHQWTGCSP TAYRKELLAV