Gene Sala_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3102 
Symbol 
ID4082838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3249956 
End bp3251356 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID638011488 
Producthypothetical protein 
Protein accessionYP_618139 
Protein GI103488578 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0806544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCC TTTCGCCTAT AGGGAGGCAT GACCGAAGCC GACAGGGGCC GGGCGCCGCC 
GCGCCGCATC ACCGCCCGAT GACCCCTTTT CCCTGGTCCG ACGTCGCGAT CATCGCGATT
CTCGTCCTTC TCAACGGCCT GTTCGCGATG TCCGAACTGG CGATCGTCTC GGCGCGGCAG
CCGCGGTTGC AGGCAGCCGA AAAGCGCGGC AGCCGCGGCG CGAAGATCGC GCGCCAGCTC
GCGTCCGACC CCGGCCGCTT CCTGTCGACG GTGCAGGTCG GCATCACGCT GATCGGGATT
CTCGCCGGCG CCTATTCGGG CGCCAGCCTG GGCGCGCCGG TCGCGGAGCG TTTGCAGGCC
TGGATAGGAC TCGACGACGA AACGGCGCTG ACCGCGGGCT TTGCCGTAGT CATCGCACTC
ACGACCTATG CCTCGCTGAT TGCTGGCGAG CTCGTGCCCA AGCAGTTTGC TTTGCGTGCG
CCCGAACCGA TCGCCATTTT CATCGCCTTG CCGATGCTGT GGCTGTCGAA AATCGGTGCG
CCGCTGGTGT GGCTGCTCGA CCGCAGCTCG GCGCTGGTAT TTCGCCTGCT CGGGCTGAGA
CGTGAATCGG AGGAGCGAGT GACCGCCGAG GAGCTGCACC TGATCGTCGC CGAAGCGTCG
AAATCGGGGG TGATCGAGGA AAGCGAGCGG GCGATCATTT CGGGCGTCGT GCGCCTCGCC
GACCGGCCGG TGCGCGAGGT GATGACGCCG CGCAAGGATG TCGACTGGAT CGACATTTCG
CTCGATGCGC GCGGCCTGCG CGACAGGCTG CTCGAAACGC CGCACAGCCG CCTGCCGGTC
GCGCGCGGGT CGGTCGACGA GATCGTCGGC GTGGTACAGG CGCGCGACAT CGCCGCGGCG
CTGTTCGCCG GGCAGACGCT GGACCTGGAA AAGCTGATGC GCCCCGCGAA GGTCATCCAC
GACCAGCTCG ACGCGATGGA CGCGCTCGAA GCCTTGCGCG CGGCCGAGGT GCCGATGCTG
CTGGTCCACG ACGAATATGG CCACTTCGAC GGGCTGGTGA CGCCCGCCGA TCTGCTTTCG
GCGATTGCGG GCGAATTTGC GTCGGACCAG GACATCGGCA GCGATCCCTA TGTGGTCGAG
CGCGACGACG GCAGCCTGCT GATCGCGGGA GCGATGCCCG CCGACCAGAT GGCCGAGCGG
CTGGGGATCG AATTGCCCGG TGACCGCGAC TATGCCACCG CCGCGGGCCA CGCGCTCGCG
GTGCTCAAGC ATTTGCCTGT GGAAGGCGAA AGCTTCACCG ACCGGGGCTG GAAGTTCGAG
ATCGTCGACA TGGACGGACG CAAGATCGAC AAGCTGCTCG TCAGCGACGT CCGCAAGCCG
AAGGGCGCCG AGGCCGAATA G
 
Protein sequence
MLRLSPIGRH DRSRQGPGAA APHHRPMTPF PWSDVAIIAI LVLLNGLFAM SELAIVSARQ 
PRLQAAEKRG SRGAKIARQL ASDPGRFLST VQVGITLIGI LAGAYSGASL GAPVAERLQA
WIGLDDETAL TAGFAVVIAL TTYASLIAGE LVPKQFALRA PEPIAIFIAL PMLWLSKIGA
PLVWLLDRSS ALVFRLLGLR RESEERVTAE ELHLIVAEAS KSGVIEESER AIISGVVRLA
DRPVREVMTP RKDVDWIDIS LDARGLRDRL LETPHSRLPV ARGSVDEIVG VVQARDIAAA
LFAGQTLDLE KLMRPAKVIH DQLDAMDALE ALRAAEVPML LVHDEYGHFD GLVTPADLLS
AIAGEFASDQ DIGSDPYVVE RDDGSLLIAG AMPADQMAER LGIELPGDRD YATAAGHALA
VLKHLPVEGE SFTDRGWKFE IVDMDGRKID KLLVSDVRKP KGAEAE