Gene Swit_4254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_4254 
Symbol 
ID5199299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp4688135 
End bp4689166 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID640583808 
ProductAraC family transcriptional regulator 
Protein accessionYP_001264732 
Protein GI148557150 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.998994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATCCT ATCGAGAAAA CCCCGCGAAC GAGCCATCGA GCATAGCCTC CTATTGCCTG 
ATGCTCGCCG ACCTGTTCGA GGCCCAGGGG CTCGACGCCG GCCGGATATT CCTCAACGCG
GGCCTCAGCC TGTCGGCGCT GCGCGGATCG ACGGGGCGCG TTCCCATCTT CCAGATCAGG
CTGTTGTGGC AGCAGGCGGT CGCGCTCAGC GGCAATCCCT CGATCGCGCT GGGCACGAGC
CGGGTCCTGT CGCCGATGAG CTACAGCTCG ATGTCGATCG CCGCGCTGAG CAGCGGATCG
CTGCGCGGCG CCATCGACAA ATATGTCCGC TATTCGCGCA TCGTCACCGA CGCGATCGAC
ATCAGGGTCG AGGACAGCGG AGATTTCACC TCGGTGATCG TCAAGAACTA TAACGAGTTC
AGGGCTCCGG AGGCGATGGA ATGCTGCCTG ACGAGCATCC TGACCCACTG CCGGCAACTG
CTTCCGAACG ACGGGATCGC GCCGGCCGTC ATCGAACTGG AGCGGGACAA GCCCTCCAAC
GAGAATCTCT TCTCCTCGCA GATGCAGTCG AAGATCCGCT TCTCGGCGCC CGGCAACCGG
CTGATCTTCC GGAACCGCGA TATCGCCAAA CCCATTCCGG GCTATTGCCG CGAGCTCGAG
ACGCAGATCG TCCAGCATTG CGACATCATG CTGGGCCAGC GCGCGAGCGT GAGCTGGTCG
ACGAGGGCCC GGCAGCAGAT ATTGCGGCAC CTGCACGTCG GCACGGTCAG CGAACGATCG
GTGGCCGGCG GCCTGCACAC CAGCGTCGAC ACGCTGCGCA GGCGGCTGAA CGGCGAGGGG
ACCTCCTATC GGGCGCTGCT CGACGACGTC CGGCGGACGC TGGCCGAGCG ATATCTCGAA
AATCGGGATC TCTCGATCAA GCAGATTTCC GGGACGCTGG GCTTCGCCAA CAGCAGCGCC
TTCGTGCGGG CGTTCCGGCG GTGGACGGGG CACGCGCCCG GTATCCACCG CCAGGAAGGC
GCACTCGGCT GA
 
Protein sequence
MISYRENPAN EPSSIASYCL MLADLFEAQG LDAGRIFLNA GLSLSALRGS TGRVPIFQIR 
LLWQQAVALS GNPSIALGTS RVLSPMSYSS MSIAALSSGS LRGAIDKYVR YSRIVTDAID
IRVEDSGDFT SVIVKNYNEF RAPEAMECCL TSILTHCRQL LPNDGIAPAV IELERDKPSN
ENLFSSQMQS KIRFSAPGNR LIFRNRDIAK PIPGYCRELE TQIVQHCDIM LGQRASVSWS
TRARQQILRH LHVGTVSERS VAGGLHTSVD TLRRRLNGEG TSYRALLDDV RRTLAERYLE
NRDLSIKQIS GTLGFANSSA FVRAFRRWTG HAPGIHRQEG ALG