Gene Snas_4620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4620 
Symbol 
ID8885825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4925253 
End bp4926518 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content55% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003513356 
Protein GI291302078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0849966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0030964 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCATC GCTTCATCGC TGCAATAGGA TCACTGGCTG TATTGACCGC GATGATAGTT 
GTCGCACCTA TGTCAGCCTT CGCCGACGAA CCTAGAAGTC AAGAGTGGAT TGTCGACGCA
CTACGTCTGA AGGATGTTCA CAAACTCTCC CAGGGTGAGG GTGTGACTGT GGGTGTCATT
GATTCTGGTG TCGATGCTGA CCATCCAGAC TTGGATGGGA ATGTCAAGGC GGGGAAGGAT
TTCGGCACTT CGAAGGATGA CGGCCTTAAT GACCATGATG GCCATGGAAC GAGCGTTTCA
AGTCTGATAG TCGGGCATGG GCATGGCGAT AGCAACACGG ATGGAATAAT TGGGATTGCG
CCCAAAGCGA AAGTGGTGTC CGTGGGCCTC CGTTGGGGAA AAGGCGAAGG TGAGAAGGTG
GGAGGCTACA TTGCCGGAGC CATCAAATGG CTGGTAGATC AGAAGGTCGA TATCATCTCA
ATCAGCATGT CGGGATACCC GGAAATATCG GACGCCGTAA AGTACGCCAT GGATAATGGC
GTGCCGCTAG TGGGTTCCGC TGGGAATACC GACAAGTATG CTGACGATCC TATTCTCGGG
CAAGTGAAGA ACAATACTAC GGGGTGGCCG GCAATGGACG CCGGCGCAAT TCCGGTTTCT
GGTACCACTC AGGACAATGA ATTCTGGGAG GGAAGCGTTC AGTTGTCAGA AGCCGCAGTG
CAACCGCAGT TTGGACTTTC GGCCCCAGCG ACGGAATTGG TAGCTGCAAC AAAAGGCGGT
GGCTATGGTA CTTTCTCCGG TACGTCGGGT TCGGCTCCTA TTGTGGCTGG CACGCTTGCG
ATCATCAAAT CGGCTTATCC CAACTTGGAT TATTCTTCCC TAGTTGACCG TTTGCTAGAT
ACAGTGGATG ATAAAGGCCC CAAGGGCTTT GACAACAAGT ATGGCTGGGG AATCGTTAAC
CCCTACAAGG CATTGACCGA AGAAACTGTC TATAAAGGTC CAACTGGCAA GGAGACGGTT
TCAGACCCTG CTGATCGGTT GCCGCTTGAC GAACAGGGCA AGGGACAAGG CCAGCAAGAC
AATGGCAACG GCTCCGGCAA CAGTGATGGC GCTTTGACCC CATCCGGGGC AAGTTCGCTA
CTCCTCCCGC TCTGCATCAC CGCAGCCGTC CTCGTGCTTG CCGCTGCGGT CGTCATCGCC
GTTGTTGTTG CCAAGCGTCG CGCCAAGACG AGGGGTGGCT CTCCGCAACC GCCCGGCGCC
GTGTGA
 
Protein sequence
MRHRFIAAIG SLAVLTAMIV VAPMSAFADE PRSQEWIVDA LRLKDVHKLS QGEGVTVGVI 
DSGVDADHPD LDGNVKAGKD FGTSKDDGLN DHDGHGTSVS SLIVGHGHGD SNTDGIIGIA
PKAKVVSVGL RWGKGEGEKV GGYIAGAIKW LVDQKVDIIS ISMSGYPEIS DAVKYAMDNG
VPLVGSAGNT DKYADDPILG QVKNNTTGWP AMDAGAIPVS GTTQDNEFWE GSVQLSEAAV
QPQFGLSAPA TELVAATKGG GYGTFSGTSG SAPIVAGTLA IIKSAYPNLD YSSLVDRLLD
TVDDKGPKGF DNKYGWGIVN PYKALTEETV YKGPTGKETV SDPADRLPLD EQGKGQGQQD
NGNGSGNSDG ALTPSGASSL LLPLCITAAV LVLAAAVVIA VVVAKRRAKT RGGSPQPPGA
V