Gene Sala_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2370 
Symbol 
ID4080810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2492752 
End bp2494644 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content66% 
IMG OID638010750 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_617412 
Protein GI103487851 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.211689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TCGACAGCCG CCTCGACACG ACGCAAGCCA CCCCCATCGG CGTCACCACC 
GGGCCGATCC GCGGCAGCCG CAAGATCCAC GTCGCCAGCC CCACCGGCAG CGGCATCCGC
GTCGCGATGC GCGAGATCCT GCTCGAACCC TCATCGGGCG AACCGCCGGT GCGCGTCTAC
GACACCAGCG GGCCTTATAC GGACCCCGAT GCGACGATCG ACATCGCCCA GGGCCTCCCC
GAACTGCGGG CAAGCTGGAT CCGCGCCCGC GGCGACGTCG AAGAGGTGGC CCAGCGCGAG
GTTCGGCCGG AGGACAACGG CCAGCTCGGC CCCGACCGTT CGGGCGGCGT CCCCGCCTTC
CCCAATGTCC GCAAGAAAGT GCTGCGCGCC AAGCCCGGCG CGAACGTCAG CCAGATGCAC
TATGCCCGCC GCGGCATCAT CACGCCCGAG ATGGAATATG TCGCGACGCG CGAGAATCTC
GGCCGCGAAC GGCTCGCCGA ATATGTCCGC GACGGGCAGG ACTGGGGCGC CAGCATCCCC
GACTATGTCA CCCCCGAATT CGTCCGCGAC GAGGTCGCGC GCGGCCGCGC GATCATCCCC
AGCAACATCA ACCACCCCGA AAGCGAGCCG ATGGCGATCG GCCGCAACTT CCTCGTCAAG
ATCAACGCCA ATATCGGCAA CAGCGCGGTC GCCAGCGACG TCGCGGCCGA GGTCGACAAG
ATGGTCTGGT CGATCCGCTG GGGCGCCGAC ACCGTCATGG ACCTGTCGAC CGGGCGCAAC
ATCCACGACA CGCGCGAATG GATCCTCCGC AACTCGCCCG TCCCGATCGG CACCGTCCCC
ATCTATCAGG CGCTCGAAAA GGTCGGCGGC ATCGCCGAGG AACTGACGTG GGAAATCTTC
CGCGACACGC TGATCGAACA GGCCGAACAG GGCGTCGACT ATTTCACCAT CCATGCGGGT
GTCCGTCTGC CTTACGTGCC CCTCGCCGCG AAGCGCGTCA CCGGCATCGT CAGCCGCGGC
GGCAGCATCA TGGCGAAATG GTGCCTCGCG CATCACAGGG AAAGCTTCCT CTACGACCAT
TTCGACGAGA TTACCGAGAT CATGAAGGCC TATGACATCG CCTACAGCCT CGGCGACGGC
CTGCGCCCCG GCAGCATCGC CGACGCGAAT GACGAGGCGC AGTTTGCCGA GCTCTACACG
CTGGGCGAGC TTACCAGGCG CGCGTGGGCA CAGGATGTGC AGGTGATGAT CGAGGGACCG
GGGCATGTTC CCATGCACAA GATCAAGGAG AATATGGACA AGCAGCTCGA GGCCTGCGGC
GAGGCGCCCT TCTACACCCT GGGGCCGCTC ACCACCGACA TCGCGCCCGG CTACGACCAT
ATCACCAGCG GCATCGGCGC CGCGATGATC GGCTGGTACG GCACCGCGAT GCTTTGCTAC
GTCACGCCCA AGGAGCATCT GGGCCTGCCC GACCGCGACG ATGTGAAGGT CGGCGTCGTC
ACCTACAAGC TCGCCGCCCA CGCCGCCGAC CTCGCCAAGG GCCACCCCGC CGCGCAGGTC
CGCGACGATG CGCTGTCAAA GGCGCGCTTC GAGTTCCGCT GGCGCGACCA GTTCAACCTG
TCGCTCGACC CCGACACCGC CGAGCAATAT CACGACCAGA CCCTCCCCGC CGAGGGCGCC
AAGACCGCGC ATTTCTGCAG CATGTGCGGG CCGAAGTTCT GCTCGATGAA GATCAGCCAG
GAGGTGCGCG AGTTCGCGAA GCTGCAAAAT CAGGACAGCG CCGGTTTCAT CGCGGCAGAA
GAGGCCGAAA AGGGCATGGC GGAAATGAGT CAGGTCTATG AGGACACGGG GCGCGAGCTG
TATATGGGCG CTGGTGGGCG CGAGCATGAT TGA
 
Protein sequence
MADIDSRLDT TQATPIGVTT GPIRGSRKIH VASPTGSGIR VAMREILLEP SSGEPPVRVY 
DTSGPYTDPD ATIDIAQGLP ELRASWIRAR GDVEEVAQRE VRPEDNGQLG PDRSGGVPAF
PNVRKKVLRA KPGANVSQMH YARRGIITPE MEYVATRENL GRERLAEYVR DGQDWGASIP
DYVTPEFVRD EVARGRAIIP SNINHPESEP MAIGRNFLVK INANIGNSAV ASDVAAEVDK
MVWSIRWGAD TVMDLSTGRN IHDTREWILR NSPVPIGTVP IYQALEKVGG IAEELTWEIF
RDTLIEQAEQ GVDYFTIHAG VRLPYVPLAA KRVTGIVSRG GSIMAKWCLA HHRESFLYDH
FDEITEIMKA YDIAYSLGDG LRPGSIADAN DEAQFAELYT LGELTRRAWA QDVQVMIEGP
GHVPMHKIKE NMDKQLEACG EAPFYTLGPL TTDIAPGYDH ITSGIGAAMI GWYGTAMLCY
VTPKEHLGLP DRDDVKVGVV TYKLAAHAAD LAKGHPAAQV RDDALSKARF EFRWRDQFNL
SLDPDTAEQY HDQTLPAEGA KTAHFCSMCG PKFCSMKISQ EVREFAKLQN QDSAGFIAAE
EAEKGMAEMS QVYEDTGREL YMGAGGREHD