Gene Snas_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5121 
Symbol 
ID8886329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5438685 
End bp5439902 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID 
Product1-deoxy-D-xylulose 5-phosphate reductoisomerase 
Protein accessionYP_003513849 
Protein GI291302571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.818961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.304528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGACA ATGGCCGCGT GACCACTCGT GACGTCGTAC TGCTCGGTTC CACCGGTTCC 
ATCGGGACGC AGGCCATCGA GGTGGCCCAG GCGAACCCCG ACAAGTTGCG CATCGTGGGG
ATTGGGGCGC AGGGGTCCAA CCCCGGGTTG TTGGCCGAGC AGGCGCTCGG GCTCGGGGTG
GATGTGGTGG CCATTCATCG GTCTTGTGCC GCCCAGGAGT TGCAGCTGGC GTTTTACGCT
GTGGCGGAGA AGAACGGGTA TTCCAAGGGC GAGTATCGGT TGCCGAAGAT CATTGCCGGG
CCTACCGCTA TGGAGGAGCT GGCGGCTTGG CCTTGTGACA CCGTGTTGAA CGGGATTACG
GGGTCGATCG GGTTGGCGCC CACGTTGGCC GCGTTGAAGG CGGGGCGGAC GTTGGCGTTG
GCCAACAAGG AGTCGCTGGT CGCCGGTGGG GCGTTGGTGC GGGGGTTGGC CGAGCCGGGG
CAGATCGTGC CCGTGGACAG TGAGCATTCG GCGTTGGCCC AGTGTCTGTG GAGTGGGAAG
GCCGAGGAGG TGCGGCGGTT GGTCGTGACC GCCAGCGGGG GGCCGTTTCG GGGGAGGACT
CGGGACGAGT TGGCCGATGT GACGGTGGAG CAGGCGCTGG CGCACCCGAC GTGGGCCATG
GGGCCGGTGG TGACGATCAA CTCGGCCACG ATGGTGAACA AGGCGCTTGA GGTCATTGAG
GCCCATGAGT TGTACGGCAT CGGGTATGAC GACATCGCTG TGATGGTGCA TCCGACGTCG
GTGATTCATT CGATGGTGGA GTTCGTGGAC GGGTCGACGA TCGCGCAGGC GTCGCCGCCC
GACATGAAGT TGCCGATCGC GTTGGCGCTG GCGTGGCCGA GAAGGTTGGC GGGGGTGGCG
AAGGCCGTGG ACTGGACGCG GTCGCACACC TGGGAGTTCT TCCCGTTGGA TGACGAGGCG
TTTCCGGCGG TGAACCTGGC GCGGGAGGCG GGTAGGACGG GGCGGTGCCT GCCCGCGATC
TACAACGCGG CGAACGAGGA GTGCGTGGAC GCGTTCACTA AGGGTGAGCT GCCGTTCTTG
GGGATCGTCG ACACTGTGGC GGAGGTCCTA GCGGCCACGC CCGGATTTGA CGAACCAGGT
ACCGTCGATG ACGTGCTGGC GGCCGAGAAG TGGGCGCGAG ACACGGCGCG CGAACGGATC
GCGCGGGTGG CGAAGTGA
 
Protein sequence
MRDNGRVTTR DVVLLGSTGS IGTQAIEVAQ ANPDKLRIVG IGAQGSNPGL LAEQALGLGV 
DVVAIHRSCA AQELQLAFYA VAEKNGYSKG EYRLPKIIAG PTAMEELAAW PCDTVLNGIT
GSIGLAPTLA ALKAGRTLAL ANKESLVAGG ALVRGLAEPG QIVPVDSEHS ALAQCLWSGK
AEEVRRLVVT ASGGPFRGRT RDELADVTVE QALAHPTWAM GPVVTINSAT MVNKALEVIE
AHELYGIGYD DIAVMVHPTS VIHSMVEFVD GSTIAQASPP DMKLPIALAL AWPRRLAGVA
KAVDWTRSHT WEFFPLDDEA FPAVNLAREA GRTGRCLPAI YNAANEECVD AFTKGELPFL
GIVDTVAEVL AATPGFDEPG TVDDVLAAEK WARDTARERI ARVAK