Gene Snas_3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3347 
Symbol 
ID8884546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3546466 
End bp3547536 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003512106 
Protein GI291300828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.86564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CTGAAAACTC GCCGCAGACC AACGACCTGC GGGTCCTCGG GTTCCAGCCG 
CTCGTGCCGC CTTCGGTGCT GCGCGACGAA CTGCCGCTGG GATCGGACCG CGCGGCCTTG
GTGCAACGCG GCCGCGAAGC GGTCAAACAG GTACTGGACG GTGACGACGA CCGGCTGTTG
CTGGTCATCG GGCCGTGCTC GGTCCACGAT CCGGTCGCGG CGGTGGAGTA CGCGAACCGG
CTGGCCGAGG CGGCCGCTCC GCTCAGCGAC GTCCTTTGTG TTGTCATGCG GGTGTACTTC
GAGAAACCCC GCACCACGTT GGGGTGGAAG GGCCTGATCA ACGATCCGAA CCTGAACGGC
ACCTACGACG TGGAGCACGG GCTGCGGCTG GCGCGGCAGG TGCTTCTCGA CGTCCTGGAC
GTGGGACTGC CGGTGGGCTG CGAGTTCCTG GAGCCGACCA GCCCGCAGTA CATCGCCGAC
GCGGTCTCGT GGGGCGCGAT CGGCGCCCGC ACCCCGGAGA GTCAGGTGCA CCGGCAACTG
GCCTCGGGGC TCTCGATGCC GATCGGGTTC AAGAACGCCA CGGACGGCAA CGTGACCGCG
GCCATCGACG GTTGCCGGGC CGCCGCGGGC AGCCACGTCT TCTTCGGCAT CGACGAGCAG
GGGCGGGGCG CCATCGTGTC GACGAGCGGC AACCCGGACT GCCACGTCAT CCTGCGAGGC
GGACGCGGCG GACCCAACTA CCAGCCCGAG CCAGTGGGAG AGGCACTCGC GTTGCTCGGT
AAGGCGGGCA TGCCGGAGCG CCTGGTCATC GACGCCAGCC ACGCCAACAG CGGAAAGGAC
CACGTCCGGC AGGCCCAGGT GGTGCGGGAG GTCGCCGAAC GGATCGCGGC GGGAGAGAAC
GGCATCGCGG GCTTGATGGT GGAGAGCTTC CTCGTCGAGG GGCGCCAGGA ACCCGGTGCA
CTGGAGACAC TCAACTATGG GCAGAGCGTG ACCGACGCCT GCATCGGCTG GCAGGAGACG
GAAGCCCTAC TGACGGAACT GGCCACAGCG GTCCGTAAGC GCCACGGCTG A
 
Protein sequence
MTTTENSPQT NDLRVLGFQP LVPPSVLRDE LPLGSDRAAL VQRGREAVKQ VLDGDDDRLL 
LVIGPCSVHD PVAAVEYANR LAEAAAPLSD VLCVVMRVYF EKPRTTLGWK GLINDPNLNG
TYDVEHGLRL ARQVLLDVLD VGLPVGCEFL EPTSPQYIAD AVSWGAIGAR TPESQVHRQL
ASGLSMPIGF KNATDGNVTA AIDGCRAAAG SHVFFGIDEQ GRGAIVSTSG NPDCHVILRG
GRGGPNYQPE PVGEALALLG KAGMPERLVI DASHANSGKD HVRQAQVVRE VAERIAAGEN
GIAGLMVESF LVEGRQEPGA LETLNYGQSV TDACIGWQET EALLTELATA VRKRHG