Gene Sala_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1186 
Symbol 
ID4080834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1226160 
End bp1227608 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID638009547 
Producttetratricopeptide TPR_2 
Protein accessionYP_616235 
Protein GI103486674 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0407507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAC AGGCGATCGC CGCGGGCGAC CTGGATGCGG CGCGGCGCGC GGCGCAGCAG 
GTCTGGACGG GCGGTGACCA TCGCTTCGAC GCGCAGCTCG TCCTCCTGGT CGATGCGATG
CGGCGATCCG ACTGGAAGGC GGCGCGCGCC TATCTGGCGG CGCCGACCGA CAAGACCGGC
GCGAATACGG GCGCGCGGCT GATCGTGCCG ATCTTTCAGG CTTGGATCGA TGTCGGCGCG
CGCGCGCGGA GGCCCGAGCG CCACCTGATG GCGACCCCTG GCACAGGCGC AGAACCGGCG
TTGATGCTCC AGGTCGTGCA GGTGCAGGCG GCAACGCGCC GCGCGGGCGA GGCGGCGCGA
CTGGCGGACG AGATTGGTCT GAGCGACCGA CTCAGCCAGC TCGTCGCGTT GCGCGCCGCG
GCGACGCTCG ATCGCGCGGG CGAGGGCGCC GCCGCCGGCC GACTCCGTGC GCGGATCGCG
CTGGCGGCCG GTGAGCGCGA GGACCCGATG CTGCTGCTGC CCGATCAGCC GGTGACGACC
CCGCGCGCGG GAAGCGCGCA GTGGCTCGGC CTGCTCGCCG ACGGCCTGGC GCGCACGCCG
AACGCCAGCA CCAAATTGCC GTTGCTGTTC GCCCGCGCCG CGCATTGGCT GAACGACGAG
GATTGGGCGG TGCGCGCAAC GCTGGTCGAG GCACTGGCTC GCGACGGGCA GAATGGCGCG
GCCATGGCGC TGCTTGACGG CCTGCGAGGA AAGTTGCCCG CGGTGCTGGT CATGCGACAG
GCCGAACTGA TCGCAGACAG CGGCGATTTG GCGGCAGGCC TCGAACGCGC CGAGGCGGCC
GCGCGCAACG ATGCGCCGCG CATGTTGCTG GTGCGGCTTG CGGACCTTGC GCGGCGGTCG
GGCAGTGCGG CGGCCGCGGC GGCCGCTTAT GAGCGGCTGG AGGCCGCGCT GGGTGAGGAG
GACCGCGCGC TGCGCAGTTC GCTGTTGCTT GCTCGCGCCG AGTTGATGTT GCAGGCGGAC
CAGTGGGACG CAGCGGCGCC GCTGATCGAG CGCGCCGTGG CCTTGCAGCC CGACGATCCC
GCCGTGCTCA ATTTCGCGGG CTATTCGGCG CTCGAACGGC GCAAGGACAT GAAGCAGTCG
CTCGCGCGGA TCGAGGCGGC GTGGGCCAGG GCACCGCAGA ATGCGAGCAT CACCGACTCG
CTCGGATGGG CCTATTTCCT GATCGGGCGC ACCGACGAAG CGGTCGAATT GCTCGAACGA
GCACAGCGCG GCGAACCCGA CAATGCGGTG ATCGTCGAAC ATCTGGGCGA TGCTTATTGG
CAGGCGGGTC GCAAGTTCCA GGCGCGCTAT AACTGGCGCG CGGCAGCGCT GCTCGCCGAC
GCCGAGATGG CGACGCGGAT CGAGGCGAAG CTGCGCGACG GGCTGACCCC GGCAACGGTG
GCACCATGA
 
Protein sequence
MLEQAIAAGD LDAARRAAQQ VWTGGDHRFD AQLVLLVDAM RRSDWKAARA YLAAPTDKTG 
ANTGARLIVP IFQAWIDVGA RARRPERHLM ATPGTGAEPA LMLQVVQVQA ATRRAGEAAR
LADEIGLSDR LSQLVALRAA ATLDRAGEGA AAGRLRARIA LAAGEREDPM LLLPDQPVTT
PRAGSAQWLG LLADGLARTP NASTKLPLLF ARAAHWLNDE DWAVRATLVE ALARDGQNGA
AMALLDGLRG KLPAVLVMRQ AELIADSGDL AAGLERAEAA ARNDAPRMLL VRLADLARRS
GSAAAAAAAY ERLEAALGEE DRALRSSLLL ARAELMLQAD QWDAAAPLIE RAVALQPDDP
AVLNFAGYSA LERRKDMKQS LARIEAAWAR APQNASITDS LGWAYFLIGR TDEAVELLER
AQRGEPDNAV IVEHLGDAYW QAGRKFQARY NWRAAALLAD AEMATRIEAK LRDGLTPATV
AP