Gene Sala_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2044 
Symbol 
ID4079941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2153741 
End bp2155585 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content70% 
IMG OID638010418 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_617086 
Protein GI103487525 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.153512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCATC CGCTTTCCGT CCCCGTCAGC CGCCGCCACA CGCTCGCCGC CCTTGGCGCG 
GGCACCGCGG GCATTGCCCT CGCCGGCCCC GCCGCGGCGT TGCAGGACGC GCCCGGCAGC
GGCGCGCAGC TCCTGCTCGA TTCGGTCGCC GACAATCTGC TCGCCCACTC GCCCGAAGGC
GCAACCTCGC TCGGCATCGA CACCGGCGCG CGCGCGGCGA TGCGCGGCAA GCTGGCGGAC
CGTTCGGCCG CGGGGGTGGC GGCGCTCGCC GACACGCTCA AAGCCGACGT CGCGCGCGTC
CGCGCCTTCG ACAAGGCGGG GCTCGACCAT CAGGCGCGCA CCAGCCTCGC CGTCGTCGAA
AGCGCCTATG ACGTCGCGCT CGCGGGCTTT GCGCTGCCCT ATGGCGACGT CGCGGTCGGC
GGCTGGCGCA ACACCCCCTA TGTGGTGATC CAGAATGTCG GCGCCTATCT CGACATTCCG
AAGTTCCTCG ACAGCGACCA TCCGGTGAAG ACGTCGGCCG ACGCCGAGGC CTATCTCGCG
CGCCTCGGCG CCTTTCCCGG CGTGCTCGAC GGCGAGACCG AACGGATGAA GGCCGCGGCC
GCGCAGGGGC TGATCGCCCC CGCCTTCCTG ATCGACAAGG CGGTGGGGCA GATGGAGGCG
AGCCTCGCAG ATGCGAAGGC GGGCGGGTCG ATGGTCGAAA GCCTGAACCG CCGCGCGGCC
GCCGCCGAGC TGAACGGCGA CTGGGGCGCG CGCGCGGCGA AGATCGTGCA GGGTCCGGTC
GCCGCCGCGC TCGAACGCCA GCTCGCCGAG ATGAAGGCGC AGCGGCCGAA GGCGACGATG
GACGCGGGCC TGTGGGCGCG CCCCGGCGGC GACGAATGGT ATGCGTGGGG GCTGCGCGCC
TCGACCACCA CGCGCATGAC CCCCGACGAG ATTCACGAAA TGGGGCGGCA GGAGCTGGCC
GAACTCCACG GCCGCATGGA CCCGATCCTG AAAAAGCTGG GCTATACGCA AGGCAGCGTC
GGCGACCGGA TGAACGCGCT CGCCAGGGAT CCGCGGTACA AATTTCCCGA CAATGATGCG
GGCCGCGCCG AAATCCTCGC CTATATCCAG ACCTGGCTCG GCAAGATCCG CGCCGAACTG
CCGCGCGCCT TCCGCACCCT GGTGAAGGGC AATGTCGAGG TGAAGCGGCT GCCGCTCGCC
GAGGAACCCG GCGCGCCCGC CGCCTATGGC GGCGCGGGGT CGATCGACGG CAGCATTCCG
GGACGTTTCT GGATCAACCT GCGCACCACC GAACTGCACA GCAAATACTC CCTCCCCGAC
CTCACCATGC ACGAGGCGAT CCCCGGCCAC GCCTGGCAGG GCGAATATGC GCATTCGATG
CCGCTGATCC GCACGATGCT GGCGTTCAAC GCCTATTCGG AAGGCTGGGC GCTTTACGCC
GAACAGCTCG CCGACGAGCT GGGCCTCTAT GACGATTTCG AGGTCGGGCG CCTCGGCTAT
CTCCAGTCGC TCGCCTTCCG CGCCTGCCGC CTCGTCGTCG ACACCGGGCT GCACGCGAAA
CGCTGGACGC GCGAGCAGGG CGTGCGCTTT TTCGTCGAGG AAAATGGCTC GAACCCGCTT
GAGGTCGCGA GCGAGGTCGA CCGCTATTGC AGCTGGGCGG GACAGGCGTG CGGATACAAG
GTCGGGCACA GCGAGATCGT GCGGCAACGC GGGCTGGCGC AGGCGGCGCT GGGTGCCCGC
TATGACCTGC GCGATTTCAA CGACGTGGTG CTGAAGGGCG GCAACGTCCC GCTCGACGTG
CTCGCGCTGA ACGTCGCGGA ATATGTAGCC GGGGCGAAAG GGTAG
 
Protein sequence
MHHPLSVPVS RRHTLAALGA GTAGIALAGP AAALQDAPGS GAQLLLDSVA DNLLAHSPEG 
ATSLGIDTGA RAAMRGKLAD RSAAGVAALA DTLKADVARV RAFDKAGLDH QARTSLAVVE
SAYDVALAGF ALPYGDVAVG GWRNTPYVVI QNVGAYLDIP KFLDSDHPVK TSADAEAYLA
RLGAFPGVLD GETERMKAAA AQGLIAPAFL IDKAVGQMEA SLADAKAGGS MVESLNRRAA
AAELNGDWGA RAAKIVQGPV AAALERQLAE MKAQRPKATM DAGLWARPGG DEWYAWGLRA
STTTRMTPDE IHEMGRQELA ELHGRMDPIL KKLGYTQGSV GDRMNALARD PRYKFPDNDA
GRAEILAYIQ TWLGKIRAEL PRAFRTLVKG NVEVKRLPLA EEPGAPAAYG GAGSIDGSIP
GRFWINLRTT ELHSKYSLPD LTMHEAIPGH AWQGEYAHSM PLIRTMLAFN AYSEGWALYA
EQLADELGLY DDFEVGRLGY LQSLAFRACR LVVDTGLHAK RWTREQGVRF FVEENGSNPL
EVASEVDRYC SWAGQACGYK VGHSEIVRQR GLAQAALGAR YDLRDFNDVV LKGGNVPLDV
LALNVAEYVA GAKG