Gene Sala_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0944 
Symbol 
ID4082443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp963812 
End bp965200 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content72% 
IMG OID638009305 
ProductXRE family transcriptional regulator 
Protein accessionYP_615995 
Protein GI103486434 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC GGCGGGTGTT TGCGGGGCCG GCGGTGCGGC GGGCGCGGCG CGCGGCGGGG 
ATGACCCAGG CCGCGATGGC CGACGCGCTG GACATATCGC CGAGCTACCT CAACCTGATC
GAGAACGGCC AGCGCCCGCT GTCGGCCACA GTCCTGGTAA AGCTGGCCGA ACGCTTTGCG
TTCGACGCCG CGACCTTGGG CGGCGAGGCG GTGGCGGGGG GCGCGGCGGG GCTCAAACGG
CGGATCGCCG ACCCGCGCTT CGCCGACCTG GGCATCGGCG CCGACGAGGT CGAGGCGTGG
CTCCAGACCG CGCCCGCGAC CGCCGCGGCC TTCGCGCGAC TGTTCGACGC GGCCCCCGAG
GCGGCGGCGG AAGGTGACGG CGACGCACCC GAAGTGGCCG CGGTGCGCCG CGCGATCGAA
AAATGGCGCA ACCATTTCGC CGACCTCGAC GCCCGCGCCG AGGAACTGGC CGACGAGTTG
CGGCTCGCGG GGGGCGACCT CTATGGCACG ATTTCGGAGC GTTTGCGCAC GCGCCACCAG
CTCGGCATCC GCATCCTGCC CAGCGACGTG ATGCCCGACC GCCTCCGCTG GCTCGACTGG
CACGCGCGGC AACTGATGCT GAGCGAACTT CTACGCCCCG CCTCGCGGAC CTTTCAGGCG
GCGGCGACGC TGGCGCAGAT CGAGGCGAAG GGCGAGATCG ACGCGCTGGT CGCGGGCGCC
GAATTCGCCG AAGCCGCCGC GGCGCGGCTG TTCGAGCGGC ACCTGATCCA TTATTTCGCG
GCGGCGCTGA TGATGCCTTA CGGCCGTTTC CTGCGCGCCT GCGATGCGAC GGGATATGAT
CTGCTGCTGC TCCAGCGGCG CTTTGGCGCG GGTTTCGAGC AGGTCGCGCA CCGGCTGACG
ACGCTCCAGC GCGTCGGCGC GCGCGGGCTG CCCTTCTTCA TGCTGCGTAT CGACCGCGCG
GGGCAGGGGA GCAAGCGCTA CGCCGGGGCG AGCCAGTCGC CGCTGACCGA CGGTGACGCG
CGCTGCCCGT TGTGGGGAAT CCACGAAGCG TTCGCGCGGC CGGGCGAGGT GATCGCCGAT
CTGGTCGAGC TGGAGGACGG GACGCGCTGG TTCACGCAAA GCCGCAGCGT CGCCGCCCCC
GGCGCGACCG GGAGCGGCAC GCCCGCGCGC TTCGCGGTAT GCGTCGGCGT CGATGCCAAG
GTCGCGGCGC CCTTGATCGC CGCGCGCGGC ATCGACCTGA TGCGATCGCC CGCAACCCCT
ATCGGTCTGG GATGTCGCCG CTGCACGCGC ACCGGATGCG TGCAGCGGTC AATGCCGCCA
CGCGGCCGCC CGCTGCGGTT CCGCGACGGC GAGCGCGGGG TGAGCGCGTT CGATTTCGCC
GGGGATTGA
 
Protein sequence
MIERRVFAGP AVRRARRAAG MTQAAMADAL DISPSYLNLI ENGQRPLSAT VLVKLAERFA 
FDAATLGGEA VAGGAAGLKR RIADPRFADL GIGADEVEAW LQTAPATAAA FARLFDAAPE
AAAEGDGDAP EVAAVRRAIE KWRNHFADLD ARAEELADEL RLAGGDLYGT ISERLRTRHQ
LGIRILPSDV MPDRLRWLDW HARQLMLSEL LRPASRTFQA AATLAQIEAK GEIDALVAGA
EFAEAAAARL FERHLIHYFA AALMMPYGRF LRACDATGYD LLLLQRRFGA GFEQVAHRLT
TLQRVGARGL PFFMLRIDRA GQGSKRYAGA SQSPLTDGDA RCPLWGIHEA FARPGEVIAD
LVELEDGTRW FTQSRSVAAP GATGSGTPAR FAVCVGVDAK VAAPLIAARG IDLMRSPATP
IGLGCRRCTR TGCVQRSMPP RGRPLRFRDG ERGVSAFDFA GD