Gene Snas_3353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3353 
Symbol 
ID8884552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3553446 
End bp3554927 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content69% 
IMG OID 
ProductPropeptide PepSY amd peptidase M4 
Protein accessionYP_003512112 
Protein GI291300834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.765099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCC CTGCCAGCCC GTCGCCGGCC GCGCCCACGG CGCCCGCTGA CGCCCCGGTC 
TCACCCCCGC CGTCACGCCG CGCCGCCCGC CGATCGATCG TGCCGTTGCT GACGCGGCTG
CACTTCTACG CCGGAGTACT GGTCGCCCCG TTCCTGGCGA TCGCGGCGCT GAGCGGCCTG
GCCTTCGTCT TCTCGCCGCA ACTCGACGAC GTCGTCTACG CCGACGAGCT GTACGTCGAC
GACATCGGCG AGACCACCCA ACCGCTCGCC GACCAGGTCG CGGCGGCCCG CGAAGCGCAC
CCGGACGGCG ACCTGGCCAC GGTCATCCCA CCCGTCGAAC CCGACGAGAC CACGAAGGTC
GTGTTCTCAC TGCCGAAACT GGGCGAGAAG CAGCACACCG TGTACGTCGA CCCCTACGAC
AACAAGGTGA AAGGCACCTT GACAACCTGG TTCGGAGAAA CCCCACTCAT GACCTGGCTC
GACGACCTGC ACCGCAACCT GCACCTGGGC GCGCTGGGCC GCCACTACTC GGAACTGGCC
GCGTCCTGGC TGTGGGTGAT CGTGCTGGCA GGCGTATTCC TGTGGATCCG TCGGCAGTGG
ACCGGCCGCC GCAAACTGCG CCGCACCGTC CTGCCGGACA CCAACGCTGG CAAGGGAGTC
CGCCGCACCC GCAGCTGGCA CGCCGCCACC GGAATCTGGC TCGCCGTCGG CCTGCTGGCC
CTGTCCGCGA CCGGCCTGAC CTGGTCGCGA TACGCGGGCG GCAACTTCGA CATAGTCCAA
GAGCAGCTCA GCGCCCAACG CCCCGTCCTG GACACCACAC TCCCCGGGAC CGACACCGGG
GGAGAGGAAT CCGGCGGCGG CCACCACGGC TCCCACACCG GCAACAGCGG CGACGCGGCC
TACGACCCGT CCAACGTCGA CAACGTGGTG GAAGTCGCCC GAAAAGCCGG ACTGACCGGC
AAGATCGAAG TGACCCCACC CACCGAAGCG GGCACCGCCT GGACAGTCGC GCAGGACGAC
GCCACCTGGC CGGTCGGCTA CGACCAGATC GCCGTCGACG CCGACACCGC CACCGTGGTT
TCCCGCAACG ACTTCGCCGA CTGGCCCCTC CTGGCCCAGC TGTCCAAACT CGGCGTCGCA
TTCCACATGG GATTCCTCTT CGGACTCATC AACCAGATCC TACTCGCGGC CCTGGCGATC
GGCCTGTTGT GTGTGACCGT GTGGGGATAC CGGATGTGGT GGCAACGCCG CCCCACCCGC
ACTGACCGCA CCGCCCCCGT GGGCGCACCC CCGACCCGAG GCACCTGGCG CCAAGTCCAC
CCTGGAGCCT TCGCTGTCGG CATCGGCGTG GTCGTCTTCA CCTGCTGGGC CATGCCCGTC
CTGGGCGTCT CCCTGATCGC GTTCCTGCTC TTCGACGCGA TAGCCGGACT CGTCCGACGC
TCAACCGTCG ACGCACCGCG CCATACCGGC GACATCGTTT GA
 
Protein sequence
MSIPASPSPA APTAPADAPV SPPPSRRAAR RSIVPLLTRL HFYAGVLVAP FLAIAALSGL 
AFVFSPQLDD VVYADELYVD DIGETTQPLA DQVAAAREAH PDGDLATVIP PVEPDETTKV
VFSLPKLGEK QHTVYVDPYD NKVKGTLTTW FGETPLMTWL DDLHRNLHLG ALGRHYSELA
ASWLWVIVLA GVFLWIRRQW TGRRKLRRTV LPDTNAGKGV RRTRSWHAAT GIWLAVGLLA
LSATGLTWSR YAGGNFDIVQ EQLSAQRPVL DTTLPGTDTG GEESGGGHHG SHTGNSGDAA
YDPSNVDNVV EVARKAGLTG KIEVTPPTEA GTAWTVAQDD ATWPVGYDQI AVDADTATVV
SRNDFADWPL LAQLSKLGVA FHMGFLFGLI NQILLAALAI GLLCVTVWGY RMWWQRRPTR
TDRTAPVGAP PTRGTWRQVH PGAFAVGIGV VVFTCWAMPV LGVSLIAFLL FDAIAGLVRR
STVDAPRHTG DIV