Gene Sala_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0074 
Symbol 
ID4082147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp80291 
End bp81436 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID638008435 
Productthioredoxin-like protein 
Protein accessionYP_615133 
Protein GI103485572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.246531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGGCT GGCACGCATC AGAAGGAAAT AGGTCAATGC TACGCAAAAG ACGCGGTTTG 
GGATTTGCGG CGATCGCGGT TACCGCGGCG ATCGGCGCGG TCGTCTATTG CTTCGGCGAC
ATACCGGGCG GCCGCTTGAA CCCGACGCCC AGCCCGATCC CGAGCGCCGC CGCCGCCGAA
TATCCCGGCC AGCCGCTTGT GGCCCTTGCC GGCAACGGTC CCTGGCTCAA CACGACTGAG
CGCACGCCGC AGGCACTGCG CGGCAAAGTC GTTCTTGTCA ATTTCTGGAC CTATTCGTGC
ATAAACTCGC TGCGGCCCCT GCCCTATATT CGCGACTGGG CCGCGAAGTA TAAAGACGAC
GGTCTGATCG TCATCGGTGT CCACACACCC GAATTCGCTT TCGAAAAGGA TGGTGACAAG
GTCCGACGCG CCGTGGCGGA ACTCGGCGTC ACCTGGCCGG TCAAGCTCGA CAGCGACTAT
GCGACCTGGA GGCTGTTCGG CAACGACGGC TGGCCAGGAT TCTATTTCAT CGATGCCAAA
GGCCAGGTCC GCCACCATCG TCTCGGCGAG GGCGATTATG CGGCGTCCGA ACGGCTGCTC
CAGCAACTCC TCGCCGAAGC CAAGGCTGCG CCAATCAATG AAAAACTGAC TGGCGACATC
GGCAAGGGGA TAGAGGCGGC TCCCGATTGG GACCAACTGC GCTCGCCCGA GACCTATGTC
GGATACCGAC AGGCCGACCG CCTCGCCGCG CCGCAACGGT TGAAGCGGGA CGCTCCCCTC
ACCTATAGCC TTCCCTCTTC CGTCCCGGCC AACCAATGGG GCTTGGGCGG AGCATGGACC
GTCGGCGCCG AGTCCGCACG CGCCGACGCC GCGTCGGCGA AGATACGCTA TCGCTTCGAG
GCGCGTGACC TGCACATGGT CCTCGGCGCG CGCGGCGACG GAACACCCGC CCGCTTCCGC
GTGACGCTCG ATGGTTTGCC GCCCGGAACA GATCATGGTG TGGATACCGA TGCGAACGGC
ATGGGCACCG TTACCAAGGA CAGGCTCTAC CAGCTCGTTC GCCAGTCCGC GGCCGTCAGA
GCCCGGACAT TCGAGATCGA ATTCCTCGAC CCCGGTGCTC GCGCCTATGT CTTCACCTTT
GGATAG
 
Protein sequence
MKGWHASEGN RSMLRKRRGL GFAAIAVTAA IGAVVYCFGD IPGGRLNPTP SPIPSAAAAE 
YPGQPLVALA GNGPWLNTTE RTPQALRGKV VLVNFWTYSC INSLRPLPYI RDWAAKYKDD
GLIVIGVHTP EFAFEKDGDK VRRAVAELGV TWPVKLDSDY ATWRLFGNDG WPGFYFIDAK
GQVRHHRLGE GDYAASERLL QQLLAEAKAA PINEKLTGDI GKGIEAAPDW DQLRSPETYV
GYRQADRLAA PQRLKRDAPL TYSLPSSVPA NQWGLGGAWT VGAESARADA ASAKIRYRFE
ARDLHMVLGA RGDGTPARFR VTLDGLPPGT DHGVDTDANG MGTVTKDRLY QLVRQSAAVR
ARTFEIEFLD PGARAYVFTF G