Gene Snas_5684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5684 
Symbol 
ID8886899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6046982 
End bp6048232 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID 
ProductDyp-type peroxidase family 
Protein accessionYP_003514407 
Protein GI291303129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.242991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.629954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGC GCAAACTCTT CTCGCTGGGG GCGGCCGGGG CGGCCGCCGT CGGCACCGGC 
TTCGCGGCCG GGCAGCTGGT ACAGGTCAGC CCGGCCGTGG CCGACGGCAA CGACGCCGCA
AACCCGGTGA GGTTCTACGG GAAACACCAG TCGGGCATCG CCACGCCCGC ACAGGATCGG
CTGCACTTCA CGGCCTTCGA CGTCATCACC TCCGATCGCG ACGAACTGAT CGGGCTGCTG
AAGGACTGGA CCGAGGCGGC GGCGCGCATG TGCGACGGCG ACTTCGCCGG GCCGGGCGGC
GCGGTGGGCG GCAACTACGA CTCGCCGCCG CAGGACACCG GTGAGGCGCA GGGACTGCCC
GCGTCGGGGC TGACCGTGAC CGTCGGGTTC GGACCGTCGC TGTTCGACAA GGACGGCGAG
GACCGGTTCG GCATCGGCGA CCGGCGTCCG GAACTGCTGG AGGAACTGCC GCTGTTCCGG
CGCGACGACA TCGACCCGAA GATCTCCGGC GGCGACATCT GCGTGCAGGC GTGCGCCAAC
GACCCGCAGG TCGCGGTGCA CGCGGTGCGC AACCTGGCCC GGATCGGCTT CGGCAAGGTC
AGCATCCGCT GGTCGCAGCT GGGTTTCGGG CGGACCGCGT CCACCTCGTC GACGCAGTCG
ACGCCGCGCA ACCTGATGGG CTTCAAGGAC GGCACCGCCA ACCTGAAGGC CGAGGAGACC
AAGGAGCTGA AGAAGCACCT GTGGGTCGGC GCCAAGGACG GTCCTGAGTG GATGGCGGGT
GGCTCGTACC TGGTGACCCG CAAGATCCGG ATGCAGGTCG AGACCTGGGA CCGCACCTCG
CTGTCGGAGC AGGAGACGAT CTTCGGCCGC GACAAGAGCG AAGGCGCGCC ACTGACGGGC
TCGAAGGAGT TCGATGAGCC CGACTTCAAG GCCAAGGCCG ACGACGAACT GGTCATCGGC
ACCGTCGCCC ACGTGCGGCT GGCGCACCCG TCGCAGCACG GCGGGGCGCG GATGCTGCGG
CGCGGCTACT CGTTTGTGGA CGGCTCGGAC GAGCTGGGGC ATCTGGACGC GGGACTGTTC
TTCATCGCGT ACCAGCGCGA TCCGTCCAAA GCGTTCATTC CGGTACAGAA GGCGCTGGCG
GACAACGACG TGCTGAACGA GTACATCAAG CACGTTTCCA GCGGAATCTT CGCGTGCCCG
GCCGGGGTCG AAGGCCCCGA GGACTACTGG GGACGGGCGC TGTTCGAGTA G
 
Protein sequence
MERRKLFSLG AAGAAAVGTG FAAGQLVQVS PAVADGNDAA NPVRFYGKHQ SGIATPAQDR 
LHFTAFDVIT SDRDELIGLL KDWTEAAARM CDGDFAGPGG AVGGNYDSPP QDTGEAQGLP
ASGLTVTVGF GPSLFDKDGE DRFGIGDRRP ELLEELPLFR RDDIDPKISG GDICVQACAN
DPQVAVHAVR NLARIGFGKV SIRWSQLGFG RTASTSSTQS TPRNLMGFKD GTANLKAEET
KELKKHLWVG AKDGPEWMAG GSYLVTRKIR MQVETWDRTS LSEQETIFGR DKSEGAPLTG
SKEFDEPDFK AKADDELVIG TVAHVRLAHP SQHGGARMLR RGYSFVDGSD ELGHLDAGLF
FIAYQRDPSK AFIPVQKALA DNDVLNEYIK HVSSGIFACP AGVEGPEDYW GRALFE