Gene Snas_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5539 
Symbol 
ID8886753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5886539 
End bp5887783 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content73% 
IMG OID 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_003514263 
Protein GI291302985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.130406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGC ACGTGACTCA GCCCGAACTC GCCGCGGCCA CCGGCCCCTC CACCCCCGAG 
TCACCCTGGC CGGTGCGGGT GGTCAGCCAC AAGATCGGCG AGTGGATCTC CCGGTTGGGC
GCGGTGTGGG CCGAGGGCCA GATCACCCAG ATCAGCCGCC GCCCCGGCGC CGGTTTCGTG
TTCCTGACGC TGCGCGACCC GGCCGCCGAG GTGAGCCTGA CGGTCGTGAC CACCCGAAAC
GTCGTGGACG CGTGTGATCC GCCACTGCGC GACGGCGCCC GGGTCATCGT GCACGGCAAA
CCCGACTGGT ACCCCGGCCG CGGCACCCTG TCGCTGCGGG CCACCGAGAT CCGCCAGGTC
GGCCTGGGCG AACTGCTGGC CCGGCTGGAG AAACTCAAGA AGCTACTGGC CGCCGAGGGC
CTGTTCGCCC CGGAACGCAA GCGCCCACTG CCGTTCCTGC CCCGCCGCAT CGGCCTCATC
ACCGGCCGCG CCTCAGCGGC TGAACGCGAC GTCCTGGAGA ACGCCAAGAC CCGGCTGCCC
TCGGCCGACT TCGAGGTCCG CGAAGTCCCG GTACAAGGCC CGCAGGCGGT GCCCAAGGTC
CTGGAGGCCC TGGCCGAACT GGACGCCGAC CCCGCCGTCG AGGTGATCAT CCTGGCGCGC
GGCGGCGGCA GCGTCGAGGA CCTGCTCCCG TTCTCCGACG AGACCCTGTG CCGCGCCGTC
TTCGCGGCCA AGACCCCGAT CGTGTCGGCG ATCGGCCACG AACCCGACAA CCCGCTCGTC
GACTTCGTCG CCGACGTCCG CTGCTCCACC CCCACCGACG CGGGCAAACG CGTCGTCCCG
GACTTCGCCG AGGAACGACG CGGCATCGAC CAGGCCCGCC ACCGGCTCCG CCAGGCCCTG
GGCGGCAAGA TCGACCGCGA ATCCCAGGCG CTGGCCGCCA TGCGTTCCCG CCCCTGCCTG
GCCCAACCGT CCCGCATCAT CGACGACCGC CAGGACGAGA TCACCCACGC CCGCGACCGG
ATCCGCCGCG CCTTCACGGC CCGGCTCGAC AAGGCCGCTC ACGACGTCAC CAGCCTGCGG
GGCCACCTGC GCGCCCTGTC CCCGCAGGGC ACCCTCGACC GCGGCTACGC CATCGTCCGC
CGCGCCGACG GCGCGGTCGT GCGCGCCGAC ACCGAAGTCA CCGCGTCGGA GGACCTGCGC
GTCCGCCTGG CCCGCGGCGA ACTGACCGTC ATCGTGAAGG AGTAG
 
Protein sequence
MLSHVTQPEL AAATGPSTPE SPWPVRVVSH KIGEWISRLG AVWAEGQITQ ISRRPGAGFV 
FLTLRDPAAE VSLTVVTTRN VVDACDPPLR DGARVIVHGK PDWYPGRGTL SLRATEIRQV
GLGELLARLE KLKKLLAAEG LFAPERKRPL PFLPRRIGLI TGRASAAERD VLENAKTRLP
SADFEVREVP VQGPQAVPKV LEALAELDAD PAVEVIILAR GGGSVEDLLP FSDETLCRAV
FAAKTPIVSA IGHEPDNPLV DFVADVRCST PTDAGKRVVP DFAEERRGID QARHRLRQAL
GGKIDRESQA LAAMRSRPCL AQPSRIIDDR QDEITHARDR IRRAFTARLD KAAHDVTSLR
GHLRALSPQG TLDRGYAIVR RADGAVVRAD TEVTASEDLR VRLARGELTV IVKE