Gene Snas_5120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5120 
Symbol 
ID8886328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5437362 
End bp5438648 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID 
Productpeptidase M50 
Protein accessionYP_003513848 
Protein GI291302570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.949013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.451136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACG TCGTTGGGTT GGTGCTGTTC GCCCTGGGCA TCTTGATCTC GGTCAGCCTC 
CATGAGGCCG GCCATATGGG CACCGCGAAG ATGTTCGGGA TGCGGGTGAC GCGGTTCTTC
GTCGGGTTCG GTCCGACGAT GTTCTCGTTC CGTAAGGGGG AGACCGAATA CGGCGTGAAG
TGGATCCCGT TGGGCGGTTT CGTCAAGATC GCCGGGATGA CGCCGCAGGA GGAGGAAGAG
GACCAGACCC CTCCCGAGGA GCAGCACCGG GTTTTCTGGC GCAAACCGGT GTGGCAGCGC
ACGATCGTGC TCGCCGCCGG GTCGACGGTG CACTTCATCC TGGGGTTCCT GATCCTGTGG
ATCATGGTGT CGTTCGTCGC GGCCCCGAAC CCGGCGTTCG CCAACGAGAT CAACACTTCC
ACGAAGATCA CCGTCTCGGA CTGTCTCATC ACCGACGCCA GCCGGGCCGA GTGCTCGGAC
GAGGACCCCG AAGCGCCCGC CAAGACGGGC GGACTGAAAT CCGGGGACAC GCTCATCAAG
GTCGCGGGCA AGCAGGTCGC CGGTGAGGAG TGCCGGGTTC CCGGCACCAG CGAGCAGCTC
GACCCGACGT CGTGGTCGTG CGCCATCAAC GCGATCCGGG CGCTGCCCCC CGGCAAGGAA
GCCACCTTCA CGATCGAGCG CGACGGTAAG ACGCTGACCA AGAAGGTCGC GCCGAAGACG
GTGGAGATCA AGGGAACCGA CGGCAAGACC CAGGAGGTCA CCCAGGTCGG CATCTCGCAG
CAGAACCCCA CCGTCCCGGG CACCGTCACC TACGGACCTG TCGACGGCGT CGGTGCCGCG
GTCACCATGA CCGGTGACAT GGCGGTGAAG ATGGGCGAGG CGATGACTCG CATCCCCGAG
AAGATCCCGG CGCTGTGGAA CTCGATCTTC GGTGAGGAAC GCGACAAGGA CACTCCGGTG
AGTGTCGTTG GCGCCAGCCG ACTCGGTGGC GAGATGGTGG AGAACGACCT GTGGGAGATG
TTCTTCTACC TGCTCATCAC CCTGAACTTC TTCATCGGCG TTTTCAACAT GCTTCCGTTG
CTGCCCATGG ATGGCGGCCA TATCGCGATC GCGTGGTTCG AGAAGGTCCG ATCCTGGATC
GCCAAGAAGC GCAACAAACC CGATCCAGGG CGCGTCGATT ACATGAAACT GATGCCGCTG
ACGTATACCG TGTTGGCGAT CATGATCGGG TTCACCGTCC TGACGGTCAC CGCTGACATC
GTCAACCCGA TCACACTGTT CAATTAG
 
Protein sequence
MAYVVGLVLF ALGILISVSL HEAGHMGTAK MFGMRVTRFF VGFGPTMFSF RKGETEYGVK 
WIPLGGFVKI AGMTPQEEEE DQTPPEEQHR VFWRKPVWQR TIVLAAGSTV HFILGFLILW
IMVSFVAAPN PAFANEINTS TKITVSDCLI TDASRAECSD EDPEAPAKTG GLKSGDTLIK
VAGKQVAGEE CRVPGTSEQL DPTSWSCAIN AIRALPPGKE ATFTIERDGK TLTKKVAPKT
VEIKGTDGKT QEVTQVGISQ QNPTVPGTVT YGPVDGVGAA VTMTGDMAVK MGEAMTRIPE
KIPALWNSIF GEERDKDTPV SVVGASRLGG EMVENDLWEM FFYLLITLNF FIGVFNMLPL
LPMDGGHIAI AWFEKVRSWI AKKRNKPDPG RVDYMKLMPL TYTVLAIMIG FTVLTVTADI
VNPITLFN