Gene Snas_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0916 
Symbol 
ID8882100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp970391 
End bp971941 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content67% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003509720 
Protein GI291298442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.584665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.344583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCC GCAGCAAGGT CTACGTCACC GGTTCCCGAC CCGACATCCG CGTCCCCTTC 
ACCGAGGTCG CGTTGTCCGA CGGTGAGCCA CCCGTCCGCC TCTACGACAC CTCCGGCCCC
GGCAGCGAAC CGGCCGACGG TCTGCCGCGA CTGCGCGCCG ACTGGATCGC CGAACGCGGC
GGCGGCGACC GCGTCACGCA ACTCCACTAC GCCCGCAACG GCGTCATCAC CCCCGAGATG
GAGTTCGCGG CGATCCGCGA GGGCCTGGAA CCCGAGTTCG TGCGCGCCGA GATCGCCGCC
GGACGCGCCA TCCTGCCCGC CAACATCAAC CACCCCGAGA CCGAGCCGAT GCTCATCGGC
AAGGAGTTCC TCGTCAAGAT CAACGCCAAC ATCGGCACCA GCGCCGTTTC GGACTCGATA
GCCGACGAGG TCGAGAAACT CACCTGGGCC ACCCGCTGGG GCGCCGACAC GGTGATGGAC
CTGTCCACCG GCAAACACAT CCACGCCACC CGGGAGCACA TCATCCGCAA CGCCGCGGTC
CCGATCGGCA CCGTCCCGAT GTACCAGGCG CTGGAGAAGG TCAACGGCGA TCCCGTCAAG
CTCACCTGGG AGCTGTACCG CGACACCGTG ATCGAGCAGG CCGAACAGGG CGTCGACTAC
ATGACCGTCC ACGCCGGGGT GCTGCTGCGG CACGTCCCGC TGGCGGCCGA TCGGGTCACC
GGCATCGTCT CGCGGGGCGG GTCCATCATG GCCGCCTGGT GTCTGGCCCA CCACACCGAG
ACCTTCCTGT ACACCCACTT CCGGGAACTG TGCGAGATCT TCGCCCGCTA CGACGTCGCG
TTCTCACTGG GCGACGGCCT GCGGCCGGGC AGCATCGCCG ACGCCAACGA CGCCGCCCAG
CTGGCCGAAC TGCGCACCCT CGGCGAACTG ACCACGATCG CCTGGGAGTA CGACGTCCAG
GTCATGATCG AGGGACCGGG CCACGTCCCG ATCCACAAGA TCAAGGAGAA CGTCGACCTG
CAACAGGAAT GGTGCCACGA GGCCCCGTTC TACACCCTCG GTCCACTCAC GACCGACGTC
GCGCCCGCCT ACGACCACAT CACCTCGGCG ATCGGCGCGG CCATGATCGG CACCTTCGGC
ACCGCGATGC TGTGTTACGT CACCCCGAAG GAACACCTCG GCCTGCCCAA CCGCGACGAC
GTGAAGGAGG GCGTCATCGC CTACAAGATC GCCGCCCACG CCGCCGATCT GGCCAAGGGA
CACCCCAGCG CCCAGGTCTG GGACGACGCG CTGTCCAAGG CCCGGTTCGA GTTCCGGTGG
GAGGACCAGT TCAACCTGTC GCTGGATCCG CAGCGGGCCC GCGAATACCA CGACGAGACC
CTTCCGGCCG AACCGGCGAA GACCGCGCAC TTCTGTTCCA TGTGCGGCCC GAAGTTCTGT
TCCATGCGGA TCTCGCACGA CCTGAAAGCT TACGCGGACA AGGGAATGAG CGAAAAATCC
CGCGAGTTCG TGGAAGCCGG CGGCAAGGTG TACCTGCCGG TCGTCGACTG A
 
Protein sequence
MSARSKVYVT GSRPDIRVPF TEVALSDGEP PVRLYDTSGP GSEPADGLPR LRADWIAERG 
GGDRVTQLHY ARNGVITPEM EFAAIREGLE PEFVRAEIAA GRAILPANIN HPETEPMLIG
KEFLVKINAN IGTSAVSDSI ADEVEKLTWA TRWGADTVMD LSTGKHIHAT REHIIRNAAV
PIGTVPMYQA LEKVNGDPVK LTWELYRDTV IEQAEQGVDY MTVHAGVLLR HVPLAADRVT
GIVSRGGSIM AAWCLAHHTE TFLYTHFREL CEIFARYDVA FSLGDGLRPG SIADANDAAQ
LAELRTLGEL TTIAWEYDVQ VMIEGPGHVP IHKIKENVDL QQEWCHEAPF YTLGPLTTDV
APAYDHITSA IGAAMIGTFG TAMLCYVTPK EHLGLPNRDD VKEGVIAYKI AAHAADLAKG
HPSAQVWDDA LSKARFEFRW EDQFNLSLDP QRAREYHDET LPAEPAKTAH FCSMCGPKFC
SMRISHDLKA YADKGMSEKS REFVEAGGKV YLPVVD