Gene Sala_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3011 
Symbol 
ID4082955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3151728 
End bp3152789 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content68% 
IMG OID638011396 
Productaspartyl/asparaginyl beta-hydroxylase 
Protein accessionYP_618049 
Protein GI103488488 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3555] Aspartyl/asparaginyl beta-hydroxylase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.257858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGG CGATGACGAC ACCAACGACA TGGCTGGAAC AGGCGCGCGA CGCGCACCGC 
CGTGGCGACG TCGCGGCGGA GACGGCAGCG CTCGACGGCG CGATCCACGC CGACCGCGGC
AATGTCGCCG CGCTGCTCGC CATGGCCGAA CTCAAGCGGC GGCTCGCCGA CGACCGCGCA
GCGGGCAGCT ATTACCGCCT GGCGCTCACC ACCGCCGTGC AGGCGCGCAG CGTCCCGCCG
GCGCTTCATC CCGGCCTTCA GCGCGCCGAG CAATTCCTCG CCGACACCGA GCGCGCCTTT
GCCGACCATC TGCTCGGCCA GCTACGAAGT GCGGGCATCG ATACAAGGAA CGCCGCGCCG
CGTGTCGCCG AGGCGCTGCG AATGCTTGCG GGCGAACAGC CGCTCTATCT GCAACAGCCG
AGCATGTTCT ATTTTCCGGG TCTCGCCCAG CGCCCCTTTT TCGAGCGGTC CGGTTTCGAT
TGGGCGGAAG CGGTCGAGGC GGCGACCGAC GCGATCCGCG CCGAGTTGCT CGCGATGATC
GACGGCGCGA CCGATCCTTT CGGACCCTAT GTCACTACCG CCCCCGGTCG CCCGCCGCCG
AACAACCCGC TGCTCGACAA GCCCGACTGG GGCGCCGCCT GGCTCTGGAA GGATGGCGCG
ATCGCCGACG GCATGGCGGG TCTATGCCCC GCGACCCTCG CCGCGCTCGA ACTGGCGCCG
CAGCCCGTCA TCCCGAACCG CGCGCCGATC GCGCTCTTTT CGCGCCTCAT GCCCGGTACG
CATATCCAGT CGCACCACGG GTTGCTCAAT ACGCGCCTGA TCTGTCATCT GCCGCTGATC
GTTCCCGATG GCTGCGGCCT GCGCGTCGGC GCCGAAACGC GCGAATGGCG CGAAGGCGAG
CTGATGATCT TCGACGACAG TTTCGAGCAT GAGGCGTGGA ACCACGGGGC GAGCGACCGC
ACCGTCCTGC TGTTCGAAAT CTGGCGCCCC GACATCGATA TCGACGAGCG CGAACAATTG
ACGCGCATCT TCGCGGCGAT CGACACCTAT GGCGAACAAT AG
 
Protein sequence
MARAMTTPTT WLEQARDAHR RGDVAAETAA LDGAIHADRG NVAALLAMAE LKRRLADDRA 
AGSYYRLALT TAVQARSVPP ALHPGLQRAE QFLADTERAF ADHLLGQLRS AGIDTRNAAP
RVAEALRMLA GEQPLYLQQP SMFYFPGLAQ RPFFERSGFD WAEAVEAATD AIRAELLAMI
DGATDPFGPY VTTAPGRPPP NNPLLDKPDW GAAWLWKDGA IADGMAGLCP ATLAALELAP
QPVIPNRAPI ALFSRLMPGT HIQSHHGLLN TRLICHLPLI VPDGCGLRVG AETREWREGE
LMIFDDSFEH EAWNHGASDR TVLLFEIWRP DIDIDEREQL TRIFAAIDTY GEQ