Gene Sala_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1028 
Symbol 
ID4082311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1062032 
End bp1063222 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID638009388 
ProductFmu (Sun) 
Protein accessionYP_616078 
Protein GI103486517 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCG CTGCGCGCAT CCAGACCGCC ATCGAAATCC TCGACGCCAT CGCTGCCGCC 
GCGCGCGAGG GCGGGGCGCC TGCGGACGCG ATCCTTGCCG AAGCGATGCG CGCGCGCCGC
TATGCGGGGT CGAAGGACCG CCGTGCGATC CGCGCGCTCG TCTATGACGT GATCCGCGCC
GTGCGCTCGG CGCCCGAGTC GGGCCGCGCG GCGATGCTCG CGCTTGCCGA TGCGCAGCCC
GGCCTAGCGG CGCTGTTCGA CGGTTCGGCT TATGGCCCGG CGCCGATCGC CGCCGACGAG
CCGCGCGCGC AGACAGGCGT TGCCGCCGAG GCGCTGATCG ACCTCTTCGA CCCGCTCGTC
GGCGAGGAAG AGCATGACTC GCTGCTGGCA CGCGCGCCGC TCGACCTTCG CGTCAATCGG
ATCAAGGCGG GGCGCGCTGA CCTCGAGACA CTGTTTCCCG AAGGCGCGCC GATCCCCGGC
GCGCCCGACG GATGGCGCCT GCCGCCCGAA ACCGCCGCAG CCCAGCACCC CGCCTATGCT
GAAGGCGCGT TCGAGGTGCA GGATGCCGCA AGCCAGTACG CGTCGGCCGC GCTGGCCGCC
GCGCCGGGGC AGGCGATCGT CGATCTGTGC GCCGGTGGCG GGGGCAAGAC GCTCGCGATC
GCTTCGCTGA CCGGCAATGC GGCTGACATC CTTGCCTGCG ACACCAATCG TGCGCGCCTG
CAACAATTGC CGCGGCGTGC CGAACGGGCC GGCGCAACGC GGATCGCAAC CCGGCTGCTC
AATCCGGGGC AGGAAGTTGC GATGCTCGCC GACTGGCAGG GGAGGGCGGA TCGCGTTTTC
GTCGATGCGC CCTGTTCGGG CAGCGGCACC TGGCGGCGCA GCCCCGAACT GCGCTGGCGG
CTCACCCCTG CGCGGCTCGA CCGCCATCTC GGCGATCAGG CGAAGCTGAT CGACCTCGGC
GCCGATCTGG TGGCGCCGGG CGGCAAACTC CTTTATGCTG TCTGCTCGAT CATCGCGCGC
GAGGGGCGGG CACAGGTGGC TGATTTTCTG AACCGGCATC CCGGCTGGAC GGCCGACGCC
GACTATCTTC CCGGCGGTGT CGGGCGCGCC GCGGGCGCCG GTTTCCTGCT GACTCCGGCG
CACGACGGCT GCGACGGATT TTTTCTCGCA CGGCTGACAT CGCCATGTTA G
 
Protein sequence
MTPAARIQTA IEILDAIAAA AREGGAPADA ILAEAMRARR YAGSKDRRAI RALVYDVIRA 
VRSAPESGRA AMLALADAQP GLAALFDGSA YGPAPIAADE PRAQTGVAAE ALIDLFDPLV
GEEEHDSLLA RAPLDLRVNR IKAGRADLET LFPEGAPIPG APDGWRLPPE TAAAQHPAYA
EGAFEVQDAA SQYASAALAA APGQAIVDLC AGGGGKTLAI ASLTGNAADI LACDTNRARL
QQLPRRAERA GATRIATRLL NPGQEVAMLA DWQGRADRVF VDAPCSGSGT WRRSPELRWR
LTPARLDRHL GDQAKLIDLG ADLVAPGGKL LYAVCSIIAR EGRAQVADFL NRHPGWTADA
DYLPGGVGRA AGAGFLLTPA HDGCDGFFLA RLTSPC