Gene Sala_2252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2252 
Symbol 
ID4080266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2364873 
End bp2367161 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content71% 
IMG OID638010630 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_617294 
Protein GI103487733 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCA TTCGCGACAG GGTTGGAAAG CGCAAATCGC GGCTGCCGCA CGTCAGCCGC 
CGCCAGATGC TCGTCGGCGC GACCGCGGCG GGCGGCCTCG CGATCGCATG GGGCCTGTGG
CCGCGCGACT ACCAGCCGAA TCTCACCGCC GCACCGGGCG AGCATGTCTT CAACGCCTTT
CTGAAAATCG GCGACGACGG GCGGATCAGC GCGATCGTCC CGCAGTGCGA AATGGGTCAG
GGCGTCACCA CGCTGCTGCC ACAGATCATG GCCGACGAAC TGGGCGCCGA CTGGCGCACC
ATCGCGGTCG AAAGCGCGCC GATCAGTCCC CTTTATACCA ACACGCTGCT GGTCGACGAG
GACAGCGCGG TCTTTACGCC GCGCGCGCTC GTCCCCGATT TCGTGTCCGA CGTGCGCCGC
TGGGTGCGCC GCGAGTGGGC CGTGCGGAAC GCCGTGATGC TGACCGCCAA TTCCTCGTCG
GTGCGGATGT TCGAGGAACC GTGCCGCGCC GCCGCGGCGC AGGCGCGCGC ACTGCTGATG
ATGGCCGCCG CCGATCGCTG GGACGCCGAC TGGCAGGAGT GCGACACGCA GGACGGCTTT
GTCATCCACG GGCGCAAGCG TCTGCGCTTT GCCGAGGTCG CGGCTGCGGC GGCGACCTTC
GAACCGCCCG CCGAACCCAT GTACCGCGCG TCTGCGCCGG ACCCGCTCTA TGGCAAGGAG
CTGACGCGGC TCGACCTGCC CGCAAAGGTC GACGGGTCGG CCAATTATGC CGCCGACATC
CGCCTGCCCG ACATGGTGTT CGCCGCCATC CGCCAGGGAC CGCTCGGCGC GACGCGACTG
ACGGGCATCG ACCGCACGGC GGGACTCGCT TCGCCCGGCC TGCTCCATGT CGTGACGCAC
GAACGCTGGG TTGCCGCCGT TGCGCGCAAC TGGTGGGCGG CGAACCGCGC GCTCGACCGC
TTTGCGCCGG TGTTCGAAAC CCGGGGCACG CCGATCTCGA CCGATCGTAT CGACCGTGCC
CTCAAGGCCG CGCTGAAGGC CGACGGCTTT CGCATCGTCA GCGAAGGCGA CGTCGGCGAA
GCGATGGAGG GGCGTACGCG CATCGCCGCC GAATATGCCG TCGCGCCCGC ACTCCACGCC
CCCATCGAAA CGCGCAGCGC GACCGCCGCG CCCGATGGCC GCGGGCTGCG GCTCTGGGTC
GCGACGCAGG CGCCGACGCA GTGCCGCGAC GCGGTTGCGC GGGCCACGGG CTTGCCCGCC
GCCGACGTCA CGCTGTTCCC GATGATGGCG GGCGGCTCGT TCGACGCCTG CCTCGATCAC
AGCGTCGCGG TACAGGCGGC AATCATCGCG CTTGAAATCA AACGCCCGGT CCAGCTCGCC
TGGTCGCGGG CCGAGGAAAT CATGCGCCTG CCGCCGCGCG CGCCCGCGCG GGCGAAACTT
ACCGCAACGC TCAACGCCGC GGGCGGGATC GACGCGCTGG TGGCGCGGAT CGCCGTCCCC
TCGACCAACC ATGAGGTGCG CGCGCGACTG TTCGACAATA GCCCCGCCGA CGTCGCGCAG
CGCGCCGCCG CCGGCCGCGC CGATGCGGCG GCGGTCGAGG GCGCCGCGAA GCATTATGCC
ATCCCCAACC GGGCGGTCGA TCATTGCCCG GCGGACATCG GCCTGCCGAC GGGGCGCTGG
CGCGGCAATG CCGACAGCTA TACCGCTTTC TTTACCGAAT GTTTCGTCGA CGAAATGGCG
GTGCGCGCCG GGTCGGACCC GCTTTCCTAT CGCATGGCGA TGCTCGGCGA GGCGCCGCTG
CTCGCGCGCT GCCTGCTCAC CGCCACCAGC CTCGGTGGCT GGGAGGGCGG GCTGGCGGGC
AGCGCGCAGG GGCTGGCCTG CCACTCGATG CGCGGCAGCC ACATCGCGCT GATGGCGACC
GCGCGGCCAA GCGACCGGGG TTTGCAGGTC GAACAACTGG TCGCGGTCGT CGACGCGGGC
CGCCTCGTCA ATCCGGCGAT CGCGCGTCAA CAGGTCGAGG GGGGGCTGAT CTTTGGCCTC
GCCGCCGCGG TCGGCGCGAC GACCGATTAT CAGGGCGGGC TTGCCACCGC GCGCAAGCTC
GGCCAGCTCG GCCTGCCGCG CCTGTCTCAG ACGCCCGAAA TCCTCGTCGA GTTTGTCGAC
AGCGACCGCG AGCCCGGCGG GCTTGGCGAA ATCGGCGTGC CCGTCGTCGC CCCCGCGATC
GCCAACGCAA TGTTCGCCGC GACGGGCCGC CGCATTCGCC GCATCCCGCT GTCGGGGCAC
CCGCTATGA
 
Protein sequence
MAGIRDRVGK RKSRLPHVSR RQMLVGATAA GGLAIAWGLW PRDYQPNLTA APGEHVFNAF 
LKIGDDGRIS AIVPQCEMGQ GVTTLLPQIM ADELGADWRT IAVESAPISP LYTNTLLVDE
DSAVFTPRAL VPDFVSDVRR WVRREWAVRN AVMLTANSSS VRMFEEPCRA AAAQARALLM
MAAADRWDAD WQECDTQDGF VIHGRKRLRF AEVAAAAATF EPPAEPMYRA SAPDPLYGKE
LTRLDLPAKV DGSANYAADI RLPDMVFAAI RQGPLGATRL TGIDRTAGLA SPGLLHVVTH
ERWVAAVARN WWAANRALDR FAPVFETRGT PISTDRIDRA LKAALKADGF RIVSEGDVGE
AMEGRTRIAA EYAVAPALHA PIETRSATAA PDGRGLRLWV ATQAPTQCRD AVARATGLPA
ADVTLFPMMA GGSFDACLDH SVAVQAAIIA LEIKRPVQLA WSRAEEIMRL PPRAPARAKL
TATLNAAGGI DALVARIAVP STNHEVRARL FDNSPADVAQ RAAAGRADAA AVEGAAKHYA
IPNRAVDHCP ADIGLPTGRW RGNADSYTAF FTECFVDEMA VRAGSDPLSY RMAMLGEAPL
LARCLLTATS LGGWEGGLAG SAQGLACHSM RGSHIALMAT ARPSDRGLQV EQLVAVVDAG
RLVNPAIARQ QVEGGLIFGL AAAVGATTDY QGGLATARKL GQLGLPRLSQ TPEILVEFVD
SDREPGGLGE IGVPVVAPAI ANAMFAATGR RIRRIPLSGH PL