Gene Snas_5065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5065 
Symbol 
ID8886272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5380125 
End bp5381213 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID 
ProductEpoxide hydrolase domain-containing protein 
Protein accessionYP_003513794 
Protein GI291302516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG CCATCACCCC GTTCCGCGTC GACATCCCGC AAGCCGACCT CGACGACCTC 
GCCGACCGCC TCGACCGCGC CCGCTGGCCC CAGGAGCCCG CCGAATCCGG CTCCCACTAC
GGCATCCCAC GCTCCCGGGT CACAGCGCTG GCCGAGTACT GGCGCACCGA CTTCGACTGG
CGGGCCGTCG AAACCGCCCT CAACGCCCAC CCCCAGTTCA CCACCGAGAT CGACGGCCAG
AACATCCACT TCATCCACGT CCGCGCCGCC GACCCTGACG CGCTACCGCT CATCCTCACC
CACGGCTGGC CCGGCTCCGT CATCGAATTC CTCGACGTCA TCGAGCCGCT GTCCCGCGAC
TTCCAACTCG TCATCCCGTC CATCCCCGGC TTCGGCTTCT CCGGCCCGAC GAGCCAGCCC
GGCTGGGACG TCGCCCGCGT CGCCCGCGCC TGGGCCGAAC TGATGCGCCG CCTGGGCTAC
GAACGCTACG GAGCCCAAGG GGGAGACTGG GGTTCGGCGA TCTCCCGCGT CCTGGGCGAC
CTGGAACCCG AGCGGGTCGT CGGCGTTCAC CTCAACTACC TGACGATGCC GCCCCCACCC
GGCGGCCCCG GCGAACTGTC CGGTGAGGAC ACCCGTCGGC TCGACGGCGT CCGCGAATAC
CTCGCCAACC AGCCCGTACA GCGGACGGTC CACTCCATCG CCCCGCAACT GGTCGGCTAC
GGCCTCAACG ATTCCCCGAT CGGCCTGCTC GCCTGGCTGC TGGACCGCTT CGACGCCTGG
GCCGATCCGG CCTCGAAGCT GTCGCCCGAC AAGATCCTCG CTGACGTGAG TCTGTACTGG
CTCACCGGCA CGGCCGCCTC GGCACCCCGC ATCCACCGCG ACTCCCCGCC CGGCCCGCTG
CCGTGCCCGG TTCCGTTGGG GGTGGCCGTG TTCGCGCGCG AGATCACGCT GCCGATCCGC
TCCTTCGCCG AAGCCGTCTA CGACATCAGG CACTGGCGCG AGTACGACCA CGGCGGCCAC
TTCGCCGCGA TGGAGGTGCC GGAACTGTTC ACCGCGGACG TGCGGGAGTT CTTCGGTTCG
ATCCGCTAG
 
Protein sequence
MTTAITPFRV DIPQADLDDL ADRLDRARWP QEPAESGSHY GIPRSRVTAL AEYWRTDFDW 
RAVETALNAH PQFTTEIDGQ NIHFIHVRAA DPDALPLILT HGWPGSVIEF LDVIEPLSRD
FQLVIPSIPG FGFSGPTSQP GWDVARVARA WAELMRRLGY ERYGAQGGDW GSAISRVLGD
LEPERVVGVH LNYLTMPPPP GGPGELSGED TRRLDGVREY LANQPVQRTV HSIAPQLVGY
GLNDSPIGLL AWLLDRFDAW ADPASKLSPD KILADVSLYW LTGTAASAPR IHRDSPPGPL
PCPVPLGVAV FAREITLPIR SFAEAVYDIR HWREYDHGGH FAAMEVPELF TADVREFFGS
IR