Gene Sala_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0951 
Symbol 
ID4082450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp975503 
End bp977344 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content74% 
IMG OID638009312 
Producttetratricopeptide TPR_2 
Protein accessionYP_616002 
Protein GI103486441 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.101575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTC GCCGCGTCTG GACGGGCGCG CTGCTGATCG CGGCGTCGCT TTCGGCGTGC 
GGCGGCGCGT CCGACACGCC GCGCGGCGAG ATGGAGGCGC GCCGTGCCGC TTTGCAGCAG
GCCATAGCCG ACGATCCCGG GGCGATCGCC GAGCGCGTCG CTCTCGCGCG GGTGGCGATC
GCGCTCGGCG ATGGCGTCGG CGCCGAAGCG GCGGTGAAGG GGGCGATCGA GGCGGGGGCG
AATGATGCGG CGTTGCGCCC ACTGCTCGCG CGCGCCTTTG AATTGCAGGG CGACGGCGCG
CGGGCGCTGG CCGCACTCGA GGCCGGGCCG GTCATCCCCG AAATGCTGGG CGAGGCGGCG
TGGGTCGCGG GCGACGTTCA CCTTGCGAAC GGCGATCTGG CGGCGGCGCG CGAGGCCTAT
GACCGCGCGG TGCACGAGCT GCCGCGCAGC TCCGCGCTGT GGGTCGATGT CGCGCGCTTT
CGCGACGCCA GCGCCGATAT GCGCGGCGCG CGCGACGCGG TCGATTATGC GATTGAACTC
GACGCGGCGA ACAGCGCGGC GCTGGCGTAT AAGGCCAATC TGGTGCGGAG GGCCGAAGGG
CTAACCGCGG CGCTGGCGTG GTACGACCGG GCGCTTGCCG CCGATCCGGG CAATGCCGCG
GCGCTGATCG ATCAGGCGGC GACGCTCGGC GATCTTGGCC GCTACCGCGA CATGCTGACG
GCGCTGCGCC GTGCGGCGGT GCTCGTCCCG CGCGAGCCGC GGATCCATTA TCTGCAAGCC
GTGCTGGCGG CACGCGCGGC GAATTACCGG CTCGCGCGCA GTCTGCTCCA GCGCACGCGC
GGCGCGCTCG ATTCCGAGCC GGGGTTCATG CTGCTGAGCG CGGTGGTCGA GCTGGAGCTG
GGCGGCGAGG CGGTGGCGGC GAGCTGGGCC GAGCGACTGC TTGCCGAACA GCCGCATAAT
TTCGCCGCAC GGCGCCTGCT GGCGGCGGCC GAATGGGCGG GCGGCGATGC CGAGGCGGCG
CTCGCGGCGC TGCGTCCGCT CGTCGCGCGG CCCGACGCCG ACAGCTGGTC GCTGCTGCTC
GCCGCGCGCG CGGCGGCCGA ACAGGGACGC GATATCGAAT CGGCCGGCTA TCAGGCGCGC
GCCGCGACGC TGGACCGCGG CGAGGCGGTG CCCTTTGCCG TCGATGCCGA TTATGGCCTG
CTGACGATGG CCGCCGACGC CGCGCCGCTC GATCCGGCGA CCGTGATCCC GGCGATTTCG
GCGGATATGG CGCGCGGCAA CACGGCGCGC GCGATCGAGC GCGCGGTCCG GCTGCGCGAT
GCCAATCCGG GAGTCGCCGA CGCGCATATG CTGCTCGGCG ACGCGGCGCT CGCGGGCGGG
CGCTACGCGC TGGCGGTCGA GGCCTATCGC GCCGCCCGCA ACCTCGACGC GGGGGAGCGC
ACGACGTTGC GGCTCGCGAA TGCGCTCTAT CGCGCCGGCG ACGCGGCGGG GTCGGGCGCG
GCGATCATGG CGTTACGCGA CCGTCAGCCG TCGAGCGTCG CCGCCGATCG GATCGCGGGC
CATCTGGCGA TCGAGCTTGG GCACTGGGAC GCGGCGATCG CGCATTTCGA GCGCGTGGGC
AGCCGGATCG GCGATCGCGA CGCCGTGGTG CTGCGCGAAC TCGCACGCGC GTGGGCGGCA
AAGGGCGACG ATGCGCGCGC GCTGGTGCTG ATCGACCGCG CCTATCGGTT GCAGCCGCTG
AACGCGGGCA TCATGGAATT ATACGCGGCG CTGCTGGAGC GGCGCGGGAA GCGCCAGGCG
GCGGCGGATT TGCGCGACAA GGCGGCGCAG ATCGGGCGGT AG
 
Protein sequence
MGFRRVWTGA LLIAASLSAC GGASDTPRGE MEARRAALQQ AIADDPGAIA ERVALARVAI 
ALGDGVGAEA AVKGAIEAGA NDAALRPLLA RAFELQGDGA RALAALEAGP VIPEMLGEAA
WVAGDVHLAN GDLAAAREAY DRAVHELPRS SALWVDVARF RDASADMRGA RDAVDYAIEL
DAANSAALAY KANLVRRAEG LTAALAWYDR ALAADPGNAA ALIDQAATLG DLGRYRDMLT
ALRRAAVLVP REPRIHYLQA VLAARAANYR LARSLLQRTR GALDSEPGFM LLSAVVELEL
GGEAVAASWA ERLLAEQPHN FAARRLLAAA EWAGGDAEAA LAALRPLVAR PDADSWSLLL
AARAAAEQGR DIESAGYQAR AATLDRGEAV PFAVDADYGL LTMAADAAPL DPATVIPAIS
ADMARGNTAR AIERAVRLRD ANPGVADAHM LLGDAALAGG RYALAVEAYR AARNLDAGER
TTLRLANALY RAGDAAGSGA AIMALRDRQP SSVAADRIAG HLAIELGHWD AAIAHFERVG
SRIGDRDAVV LRELARAWAA KGDDARALVL IDRAYRLQPL NAGIMELYAA LLERRGKRQA
AADLRDKAAQ IGR