Gene Snas_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1120 
Symbol 
ID8882305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1191725 
End bp1192849 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content68% 
IMG OID 
ProductEpoxide hydrolase domain-containing protein 
Protein accessionYP_003509923 
Protein GI291298645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.636504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.342191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGA TCGAACCGTT CACCATCGAC ATCGCCCAGT CCGAACTCGA CGAACTGACC 
GCTCGGCTCG AGCACACCCG TTGGCCTGAC GAGCTTCCCG GCGTCGGCTG GTCCTACGGC
ACCGCGCTGG GCTACGTCCG CGACCTGGCC GGCCATTGGC GCGACGGTTT CGACTGGCGT
GCTCAGGAGG CTCGTCTCAA CGAGCTGCCC CAGTTCACCA CGAGGATCGA CGGGCAGACG
ATCCACTTCG TACACGTAAG GTCGCCGGAG CCGGACGCGT TGCCGCTGAT CCTCACCCAC
GGCTGGCCCA GCACCTTCGC GGACTTCGCC GCGATGGTGG GACCGTTGAC GAATCCCCGG
GCTCACGGGG GTGACCCCGC CGACGCCTTC GACGTGGTGA TCCCGTCGGT GCCGGGGTTC
GCGTTCTCCG GGCCGACCAC CGAGACCGGC TGGGACTGCC AGCGGGTGGC GGCGGCCTGG
GCGGAGTTGA TGCGCAGGTT GGGGTATGAC CGCTATGGCG TGCAGGGCAG CGATTTCGGG
GCCCTGGTGA CGCCGAGGCT GGCCCGGTCG CAGCCGGATC GGGTGGTGGG GATGCACCTC
AACGCGGTGC CCACCATGCC GCAGGTGGAT CCGTCCGAAA TGGATGACCT GAGTGCCGAG
GAGCGGGAGT ACTTCGCCGG GATGGATCAG TGGGAGGAGG TGTCGGGATA CGCGGTCGTG
CAGAGCACCC GTCCGCAGAC GCTGGCCTAC GCGTTGAGCG ATTCGCCGGT GGGGCAGTTG
GCCTGGTACG GCGACTGGTA CGCCGCGCAC GGCACCAAGG TCGGCGACCT GTCGCCGGAC
CGGATCCTCA CCAACGTCTC GCTGTTCTGG TTCACCCGCA CCGGAGGTTC GGCGATCCGG
TTGTACAAGG AGAGCGCGGC GGCCTGGGCC GAGCAGCCCG AACGGTCGGA GGTGCCGACC
GGTCTGACGT TCTTCAAGGG CGAGAACGGG GTCCGCCGTT TCGCGGAGCG GGAGTACCGC
GTCACGCACT GGACCCACCA CGACGCGGGC GGGCACTTCG CCGCCCTCGA AGTGCCCGAA
CTGCTGGCGG GCGACATCCG GACCTTCTTC CGAGAAGTTC GATGA
 
Protein sequence
MTQIEPFTID IAQSELDELT ARLEHTRWPD ELPGVGWSYG TALGYVRDLA GHWRDGFDWR 
AQEARLNELP QFTTRIDGQT IHFVHVRSPE PDALPLILTH GWPSTFADFA AMVGPLTNPR
AHGGDPADAF DVVIPSVPGF AFSGPTTETG WDCQRVAAAW AELMRRLGYD RYGVQGSDFG
ALVTPRLARS QPDRVVGMHL NAVPTMPQVD PSEMDDLSAE EREYFAGMDQ WEEVSGYAVV
QSTRPQTLAY ALSDSPVGQL AWYGDWYAAH GTKVGDLSPD RILTNVSLFW FTRTGGSAIR
LYKESAAAWA EQPERSEVPT GLTFFKGENG VRRFAEREYR VTHWTHHDAG GHFAALEVPE
LLAGDIRTFF REVR