Gene Sala_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3122 
Symbol 
ID4082708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3272042 
End bp3274363 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content67% 
IMG OID638011507 
ProductSARP family transcriptional regulator 
Protein accessionYP_618158 
Protein GI103488597 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGGT CGACATGTGG GTCACCCACG TTTCGGTTGC TGGGGGATTT TGCGGTTGAA 
CCGGCGCTGC TCGCGCCGCA TGGCCGCAAG GCCAGAGCGC TTCTGGCGCG ACTCGCTCTG
GCCGACGGCC CGCTTTCGCG CGACCGGCTC AGCGCGCTCC TGTGGAGCCA TCGGGCCGAG
CCCCAGGCGC ACGCCAGTCT GCGGCAATGC CTGATGGAAC TGAAAGCCTG GACGCGTGCC
TCGCCGCCGC TGCTCCATGC CGACCGCGAC GCGATCGCGA TCGATCATGC GCATGTCGCT
GACGATGTCT CCCTGCTGCT TTCGGCCTGT GCATGCGACG ATGCCGACGC GGTGCTGCGC
TGGCTGCCCG CTGACGAGGT TGCGCTGCTC GCCGATCTCG ACGGGGTCGA CGAAGCCTTT
GACGACTGGC TGGCCGCCGA ACGGGCGCGC AACACCGAGG TGGTCGTCCG CGAGGTGCTG
GCCATGGCCG AACGACTGCT GGTCGCCGGC GATATCGACG GATCGCTGCG CGTTGCCGAC
CGGATGACCC GATTCGATCC GTGCAGCGAA CATGCCGCTC GTCTGGTCAT GCAGGCGCGC
TGGCAGGCGG GGGACGATGA TGGCGTGCGC CACGCATGGC ACCGGCTCGA GGATGCGGTC
GCGCGGGGGC TCGACGGGCG ACCATCGCCG GAAACCGCCA GACTGTATCA TCGGTTGATG
GACGAACCTT CATGCCGGTC GAACGAGGTG GCGCTCCAGA CGGGCAAAAA ACCGCCGCCC
GACCGGCGGA CGCATCGCAT TGCGACCGTT GCGGCAACCG TTGCGGCGTT CGGCCTGATG
AGCGCTGATG CGCCGCGTGA GGGAGGCGGG GAGCCCGCGA TGATGACGCC GACAGTACGG
ATCGAACCGG TCGTTTCGCG CAACCATGGC ATGATTGAAC TTGGTTTCGC CGATGCGCTT
GCAGGCGATT TTGTCCGGCT GGCCAACGCG TCAGGCGGCC TCGTCCGCAT TCTCGATGGA
GAGGCCGTCG GCAGAGGCGA CTTCATCGTT CGTGTCGCGA TCAATCGCGA TGACGGCAAT
TTGACAAGCG AATCGCGCGT CGTCGATGCG CGGAGCGGCG CGATTCTCTG GTCCAGTCGG
CTTTCGGGGT CACCGCGCGA CCTGACGCGC CTGCGCGAAC AGACGGCGGT GTCGGTGGCT
GCGGTGATTG ATTGCGCGCT GCGTCTCAAC GACGGACAGG CGGCGCTTGC CGCCGATGCC
GACCGGCGCG CGCAAGTGTT CGCGATCTGC GACGCTCATG ACGATCAAGA CGGCGCACGC
GCCGTCGCCC TGCTCGAACG GTTTGCAGCG CGTTGGCCCG GCGACAGGGT CGCCCTCGGC
CAGCTAGCGC TGGCCCGCGC AAAGCTCATA TTCGGCGAGG CCGAAACGGC CGAGCAGGAA
CGCCAGCGCC AGCTCGCGAT TCGCGCCGCG CGCCGCGCGC TGGCGAACGA CCCCGCCAAT
GTCTATGCAA TGGTGGCGCT TTCGCAGGCC GGGCGACGCG AACGCTACAT GGTCGACGGC
CTTCCCATGC TGAAGCGCGC GTTGTCCGTC GATCCGCGAT TTTCGACCGC ATTGATGCTC
CAGGCGACAG GACTGTTTCA GGCCGGCTAT GTCGCGGCCA GCGTCCGGCC GTCGATTGAC
GCCGCCAACG CCGATCCCAC CTCGATCGTC AGGGCGCTTG CGGTCGTTCG GCGGCTGGCC
GCGGCGGGGC GGATGAAAGA AGCCTGGGAC CGGCTCGACG CGGTCGCCGC CGTCTGGCCG
GACCATCCCG ATCTCGTCGA GCATCGCTAT CGACTGACGC TCGAACAGCC TGATAGGGTC
CGCGCCGCGG CGCTTGTCGC GTCGAGCGGC GTCGATGACC GATGGCGGCT CGATCGTCTC
ATCCTCCAAC AGGTCGCAAC CGGACGGCAG GACAGCGCCG CACTCGATGC CGCGGCCGAG
ATTGAATATT CGCGATTGCC GGCCGCCGCT TATCAACTCG CCGCGCTCTA CACGCGGATC
GGCGACATCC GGCGGGCGCT GATCTGGCTC GACCGGGCGC CGGTTCGCCA AACCAGCGGT
CAATGGTCGT TACTCTATTG GCCATCGGTG GCGCCGCTCC GCCGCGAACC GCGCTTTTTC
GCCAAGATGG CGCAGCTTGG GCTTGTCGAT TATTGGCGGC GCGAGAATCG CTGGCCTGAT
TTCTGCCGCG AACCAGGGCT CCGCTACGAT TGCCGACGCG AAGCTGACAG ACTGGTTGCC
GCCGGGCGCG CGGTCAATGC CGGAAATGCC GCATTCCGGT GA
 
Protein sequence
MPGSTCGSPT FRLLGDFAVE PALLAPHGRK ARALLARLAL ADGPLSRDRL SALLWSHRAE 
PQAHASLRQC LMELKAWTRA SPPLLHADRD AIAIDHAHVA DDVSLLLSAC ACDDADAVLR
WLPADEVALL ADLDGVDEAF DDWLAAERAR NTEVVVREVL AMAERLLVAG DIDGSLRVAD
RMTRFDPCSE HAARLVMQAR WQAGDDDGVR HAWHRLEDAV ARGLDGRPSP ETARLYHRLM
DEPSCRSNEV ALQTGKKPPP DRRTHRIATV AATVAAFGLM SADAPREGGG EPAMMTPTVR
IEPVVSRNHG MIELGFADAL AGDFVRLANA SGGLVRILDG EAVGRGDFIV RVAINRDDGN
LTSESRVVDA RSGAILWSSR LSGSPRDLTR LREQTAVSVA AVIDCALRLN DGQAALAADA
DRRAQVFAIC DAHDDQDGAR AVALLERFAA RWPGDRVALG QLALARAKLI FGEAETAEQE
RQRQLAIRAA RRALANDPAN VYAMVALSQA GRRERYMVDG LPMLKRALSV DPRFSTALML
QATGLFQAGY VAASVRPSID AANADPTSIV RALAVVRRLA AAGRMKEAWD RLDAVAAVWP
DHPDLVEHRY RLTLEQPDRV RAAALVASSG VDDRWRLDRL ILQQVATGRQ DSAALDAAAE
IEYSRLPAAA YQLAALYTRI GDIRRALIWL DRAPVRQTSG QWSLLYWPSV APLRREPRFF
AKMAQLGLVD YWRRENRWPD FCREPGLRYD CRREADRLVA AGRAVNAGNA AFR