Gene Sala_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1451 
Symbol 
ID4081533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1500982 
End bp1502088 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID638009816 
ProductDNA alkylation repair enzyme-like protein 
Protein accessionYP_616497 
Protein GI103486936 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AAACGCTGCT GCTAAAAAAT CTGCTGGGGC CGCAGGCCGT TGCGACGATC 
GCCGACGCCG GGACTGCCGC GACGCCGCAT TTCGACCGCC CGACGTTCGT GCGCGCCGCG
TCGGAGGGCC TCGATGCGCT GTCGATCATG GAACGCGTGC GCCATATCGC CGATGCGCTG
CACGGCGCGC TGCCAGGCGA TTATGGCGCG ACGCTCGATG CGCTGCGCGC AATGGCACCG
CGACTGACGC ACGGCTTTCA GGCGATCGCG ATCACCGAAG TGGTGGCACG CCACGGCCTC
GACGATTTCG ATCGCTCAAT GGCTGCGCTT GCCGATCTGA CGCGCTTTGG TTCGGCCGAG
TTTGCGATCC GTCCGTTCCT GACCGCCGAT CCCGACCGCG CGCTGGCTAC GATGGGGCGC
TGGACGACGA GCGACGACGA GCATGTGCGC CGCCTTGCGA GCGAGGGCGC GCGGCCGCGG
CTGCCGTGGG CGGCGCGTGT CCCCGCGCTG AAGGTCGATC CGACGCGCGC CGCGCCGATC
CTCGAGGCGC TGAAGGCCGA CCCTGCCCCC TATGTCCGCA AATCGGTCGC GAACCATCTC
AACGATATTG CCAAGGACCG GCCGGGCTGG CTGGTCGAGC GCCTCGCGCA CTGGTCGCAG
GACGACGAAC GCACCGCATG GATCGTCCGC CACGCGCTGC GCACATTGAT CAAGAAGGGC
GACCCCGCCG CGCTCGCGCT GATCGGCGTC GGCCATGGCG CCGCAGTGAC ACTGCGCCGC
TTTGCTGTCG AACCGGCCAG CGTCCGCCTT GGCGACCGGA TCGCCATCAC CGTTGCGTTG
GCGTCGGAGT CACCCGACGA TCAGCCGTTG GTGGTCGACT ACCGCATCCA TTATGCCCGC
CCCGGCGGCA AGAGTGCGCC GAAGGTGTTC AAGCTCAAGA GCTTCACGCT CGCGGGGCAC
GATACCGCCG CGCTGTCGAT TTCACAGACG ATCCGCGATT TCACGACCCG CCGCCATCAT
CCAGGGCGGC ACCGGGTCGA ACTGATGGTC AATGGCCAGG CGATGGCGGA GGCCGCCTTC
GACATCGTTG CCGACGATGG CGCCTAG
 
Protein sequence
MSGETLLLKN LLGPQAVATI ADAGTAATPH FDRPTFVRAA SEGLDALSIM ERVRHIADAL 
HGALPGDYGA TLDALRAMAP RLTHGFQAIA ITEVVARHGL DDFDRSMAAL ADLTRFGSAE
FAIRPFLTAD PDRALATMGR WTTSDDEHVR RLASEGARPR LPWAARVPAL KVDPTRAAPI
LEALKADPAP YVRKSVANHL NDIAKDRPGW LVERLAHWSQ DDERTAWIVR HALRTLIKKG
DPAALALIGV GHGAAVTLRR FAVEPASVRL GDRIAITVAL ASESPDDQPL VVDYRIHYAR
PGGKSAPKVF KLKSFTLAGH DTAALSISQT IRDFTTRRHH PGRHRVELMV NGQAMAEAAF
DIVADDGA