Gene Sala_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1024 
Symbol 
ID4082307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1057627 
End bp1058655 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID638009384 
ProductLacI family transcription regulator 
Protein accessionYP_616074 
Protein GI103486513 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAT TGAGGGACGT CGCCCGCGAA GCAGGAGTTT CGGTCGCGAC CGCATCGCGC 
GCGATCAACG GTCTGGGCAA TGTCACCGCG CCGACGCGCG CGGCGGTGAT GGCGGCGGTA
AAGAAGCTCA ATTTCGTGCC GCACAGCGGC GCGCGCAGCC TGACGCGGCG CAGGACCGAC
ACTGTCGGGG TCATCCTGCC CGACCTGTTC GGCGAGTTCT TCTCCGAGAT CATTCGCGGT
ATCGACCTCG TTGCGCATGA ATCGGGGATG CACCTGCTGC TCGGCAACAT GCACGGCAGC
ACGCACGAAA CCGCCGCTGC GATCGCGGCG ATGCGCGGCC GCGTCGATGG CCTGCTCGTG
ATGCCGCCGG ACCTCAAGCC CGAACTGCTC GCCGACTATC TCGACCCCGC GCTGCCGACG
GTGCTGCTCA ATTACGACGC CGGGCCGCTC GATCTTCCCT TCGTCGCGGT CGACAATTAT
CAGGGCGCCT ATCGGATGAC CGAAGCGCTG CTGGCGCGCG GCGCGAAACA GGTGGTCCAC
ATCGCCGGCC CCAAGCATAA TCGCGACGCG CGCGACCGCC AGCGCGGCTT CACCGATGCG
ATGGCGAAAA TCGGCGGCGT CCGCACGCCG GCAATCCTGC CGGGCGACTT TTCCGAGGAA
AGCGGCGCGC AGGCGGCGCG GCTGCTGGTG CAGGGCCAAT TGCCCGCCGA CGCGGTGTTC
GCGGCCAACG ACCAGATGGC GGTCGGGCTG ATCGCGGGGC TGGCCGAAGC GGGCAAGTCG
GTGCCGAGCG ACATCATGGT CGCGGGCTTC GACGATATTC CGCTCGCGCG TCACCTCAAC
CCGTCGCTTT CGACGATGCA GGTGCATATC GACCGGTTGG GATCGACCGC GATGATGCTG
CTGCTGCGCA TGTTGCGCGG CGAAACGCTT GGCGCGGCGA ACGCGACGAT CCTGACCCCC
GAGTTGGTCA TCCGCGGGAC GACGGGCGGT GAAACCGCGC CCGTACCCAG CAGCGGGGCG
CGCCCGTGA
 
Protein sequence
MATLRDVARE AGVSVATASR AINGLGNVTA PTRAAVMAAV KKLNFVPHSG ARSLTRRRTD 
TVGVILPDLF GEFFSEIIRG IDLVAHESGM HLLLGNMHGS THETAAAIAA MRGRVDGLLV
MPPDLKPELL ADYLDPALPT VLLNYDAGPL DLPFVAVDNY QGAYRMTEAL LARGAKQVVH
IAGPKHNRDA RDRQRGFTDA MAKIGGVRTP AILPGDFSEE SGAQAARLLV QGQLPADAVF
AANDQMAVGL IAGLAEAGKS VPSDIMVAGF DDIPLARHLN PSLSTMQVHI DRLGSTAMML
LLRMLRGETL GAANATILTP ELVIRGTTGG ETAPVPSSGA RP