Gene Snas_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5038 
Symbol 
ID8886245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5347450 
End bp5348844 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content69% 
IMG OID 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003513768 
Protein GI291302490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.332625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.389719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG ATTGGCATCA ACTGCGCAAT CCCGGACTCG TCCCCGGGAG TCTCGACCCC 
GCCGCGGCCG TTGAGAGGGC CGCCCAACTC GGCCTCGACC GCTGGCGCGA TCTGCCCCGC
GACCAGATGC CCCCGTGGGA GGACTACTCC GAGGTCGAGG ACGTCTACGG CGTCCTGAAA
TCGGTTCCCC CGATCGTGGC GCCCTACGAG GTGGACGCGC TGCGCGACCA GCTCGCGCAG
GTGTGCGCCG GGAAGGCGTT CCTGTTGCAG GGCGGCGACT GCGCCGAGAC CTTCATCGAC
AACACCGAGG CCCACCTGCT GGGCCTGGCC CGCACCATCC TGCAGATGGC GGTCGTGCTG
ACGTACGGCG CCAGCATGCC GGTGGTCAAG GTGGGCCGGG TCGCCGGTCA GTACACCAAA
CCCCGCTCCA GCGCGACCGA CTCGCTGGGG CTGCCCAGCT ACCGCGGCGA CATGATCAAC
TCGCTGGAGA AGACCCCCGA GGCCCGCCGG GCCGACCCGC AGCGCATGAT CCGCGCCTAC
GCCAACGCGG CGGCGGCCAT GAACATGCTG CGCGCCTACC TGTCGGGCGG CATCGCCGAC
CTGCGGGCGG TGCACCACTG GAACAAGGAC TTCGTCCGGC AGTCCCCGGC CGGGGAACGC
TACGAGGCCA TCGGCCGCGA GATCGACCGG GCGCTGGCGT TCATGGACGC CTGCGGCGTC
GACGACGACG CCCTGCACAC CGTCACGATG TACGCCAGCC ACGAGGCACT GGCGCTGGAG
TACGACCGGG CGCTGACCCG CGTCAACAAC GACCGGGCCT TCGGGCTGTC GGGCCACTTC
CTGTGGGTCG GCGAACGGAC CCGCCGCCTC GACGGGGCCC ACATCGACTT CATCTCCCGG
CTGGCCAACC CGATCGGCGT CAAGATCGGC CCGTCCACGT CGCCGGACTG GGCGCTGGAA
GCCTGCGAGA AGCTCAACCC GGACAACATC CCCGGCAAGC TGACCCTGAT CTCCCGCATG
GGAAACCAGA AGATCCGCGA CGTGCTGCCC ACGATCGTGG CGAAGGTGCA CGCCGCCGGA
CGGCAGGTCA TCTGGCAGTG CGACCCGATG CACGGCAACA CCCACGAATC GTCCAACGGC
TACAAGACCC GCGACTTCGA CCGGGTCGTG GACGAGGTGC TGGGCTTCTT CGAGGTGCAC
CGCTCCACCG GCACCCACCC CGGCGGTATC CACATCGAAC TGACCGGTGA GGACGTGACC
GAGTGCGTCG GCGGCGCCCA GGCCCTGGAC GACAAGGACC TGGAGCAGCG CTACGAGACC
GCCTGTGACC CCAGGCTCAA CACCCAGCAG TCGCTGGAGC TGGCGTTCCT GGTCGCGGAG
ATGCTGCGGC ACTGA
 
Protein sequence
MTIDWHQLRN PGLVPGSLDP AAAVERAAQL GLDRWRDLPR DQMPPWEDYS EVEDVYGVLK 
SVPPIVAPYE VDALRDQLAQ VCAGKAFLLQ GGDCAETFID NTEAHLLGLA RTILQMAVVL
TYGASMPVVK VGRVAGQYTK PRSSATDSLG LPSYRGDMIN SLEKTPEARR ADPQRMIRAY
ANAAAAMNML RAYLSGGIAD LRAVHHWNKD FVRQSPAGER YEAIGREIDR ALAFMDACGV
DDDALHTVTM YASHEALALE YDRALTRVNN DRAFGLSGHF LWVGERTRRL DGAHIDFISR
LANPIGVKIG PSTSPDWALE ACEKLNPDNI PGKLTLISRM GNQKIRDVLP TIVAKVHAAG
RQVIWQCDPM HGNTHESSNG YKTRDFDRVV DEVLGFFEVH RSTGTHPGGI HIELTGEDVT
ECVGGAQALD DKDLEQRYET ACDPRLNTQQ SLELAFLVAE MLRH