Gene Snas_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5903 
Symbol 
ID8887119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6262328 
End bp6263638 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003514624 
Protein GI291303346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.696635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCTCA CCGGACCCCA CAGCTTCGGG CCCGGTGAGG TCCACAAACC CGGGACCGCG 
TCGATCGGCC CGGCCGCGGT CGTGTTCCAT CATGGCGTGG GGGAGACGAT GCGAACGATC
GTGGTGCTGT TCACCCGGGA CCTGCGGATA CGGGACAACC CGGCGTTGGC GCTGGCCTCC
CGCCACGCCG ACACGGTGGT GCCGTTGTAC GTCGCGGACG ATTCACCGCC GCTGCCACCG
AACCAGCGGC GGTTCCTGGT CGAGTCCCTC ACCGATCTGC GGGAGTCGTT GCGGCGGTTG
GGCGGTGACC TGCTGGTCCG GCGCGGCGAC CCGGTGGAGC AGACGCTGAA GCTGTGCCGA
CTCCTCGCGA CGGACGGCAT CGGCATGGCC GAGGACTACG GCCCCGCCGC ACGGCGGCTG
CGGCAGCGGC TGGCCGAGGC CGCTGAGGCC GAACGCGTCG GACTGCGGCT CTTCCCCGGT
GTCACCATTG TGGAACCGGG TGCGGTACGG CCGACCACCG GCGCCGACCA CTACAAGGTG
TTCACCCCGT ACCTGCGCTC CTGGAGCGCG ACACCCTGGC GACCCGAACA CGAAGCGCCC
CAGGCGATCC GACTACCTGC CGACGTCACC GGCGACGACC CCGCCTCGGT GATCGGCCCG
GTCGAGGGCG GCTCGTCCGA CGTCGTCGAC GGCGGCGAGA CGGCGGGGCT GCGCCGCTGG
GATTCCTGGA TCGACCGCGA GCCGGACTAC CCGGCGATCC ACGACGACCT GGCCGCCGAC
GACACCACCC GGCTGAGCGG GTACCTGCGG TTCGGTTGCG TGTCACCACT TGTCGTCGCC
GCCGACCCCC GCACCCCCGA GGCACTGGTC CGGCAGCTGT GCTGGCGGGA CTTCTACCAC
CAGGTGCTGC ACGGTTTCCC GCGACTGGCC ACCGACAACT ACCGGCCCGG CGCCCGCGAC
GCCTGGGTGG ACGACGAACC GGCGCTGCGG GCCTGGCAGG ACGGCGAGAC CGGCGTCCCG
CTCGTCGACG CCGGAATGCG GCAGCTGCGG ACCCAGGGCT GGATGCACAA CCGGGCCCGG
ATGGTCGCCG CGTCCTATCT GACCAAGGAT CTCGGCATCG ACTGGCGGCA CGGCGCGGCC
TGGTTCGACC GCTGGCTCGT CGACGCCGAC GTGGCCAACA ACTACGGCAA CTGGCAGTGG
ACGGCGGGCA CCGGCAACGA CTCCCGGCCG TACCGCAGGT TCAACCCCGC ACGGCAGGCG
CAGCGCTACG ACCCGCGGCA CGAATACCGG GATCGCTGGC TGCGCAACTG A
 
Protein sequence
MDLTGPHSFG PGEVHKPGTA SIGPAAVVFH HGVGETMRTI VVLFTRDLRI RDNPALALAS 
RHADTVVPLY VADDSPPLPP NQRRFLVESL TDLRESLRRL GGDLLVRRGD PVEQTLKLCR
LLATDGIGMA EDYGPAARRL RQRLAEAAEA ERVGLRLFPG VTIVEPGAVR PTTGADHYKV
FTPYLRSWSA TPWRPEHEAP QAIRLPADVT GDDPASVIGP VEGGSSDVVD GGETAGLRRW
DSWIDREPDY PAIHDDLAAD DTTRLSGYLR FGCVSPLVVA ADPRTPEALV RQLCWRDFYH
QVLHGFPRLA TDNYRPGARD AWVDDEPALR AWQDGETGVP LVDAGMRQLR TQGWMHNRAR
MVAASYLTKD LGIDWRHGAA WFDRWLVDAD VANNYGNWQW TAGTGNDSRP YRRFNPARQA
QRYDPRHEYR DRWLRN