Gene Snas_6121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6121 
Symbol 
ID8887342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6482355 
End bp6483563 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID 
ProductMembrane dipeptidase 
Protein accessionYP_003514838 
Protein GI291303560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.99517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGA CCGCATCGGC GCGCGCGGGC GCGCTCCTCG CCACCGCCCC CGTCATCGAC 
GGGCACAACG ACCTGCTGAT TCGCATGCGC GGCAAGGTCC GTTACGACTT CGACGCGATC
GACATCGCCG TCGACCAGAC CGAGCACGGC CTGCACACCG ACATCCCCCG GATGCGCGCC
GGCGGCATGG GCGGCCAGTT CTGGTCGGTG TTCGTCCCGG TCAGCCTCCA GGGCGAGGCG
GCGGTCACGG CCACATTGGA ACAGATCGAC GGCGCCCACG AGATGATCGG CCGCTACGAC
GATCTCGCCC TGGCCACCAC CGCCGACGAG ATCGACAAGG CCTTCTCCGA CGGCAGGATC
GCCTCGCTGC TGGGAGCCGA GGGCGGCCAC TCCATCGCCG ACTCGCTGGG CACGCTGCGG
ATGATGTACC GGCTCGGCGT CCGCTACATG ACCCTCACCC ACACCTCCAA CACCGCGTGG
GCCGACAGCG CCACCGACGC GCCCGTCGTC GGCGGCCTGA GCGAGTTCGG CCGCGAGGTG
GTGCGCGAGA TGAACCGCCT GGGCATGCTC GTCGACATCT CCCACGTCGC TCCCTCCACA
ATGCACGCCG CGCTCGACGT CAGCGAGGCG CCCGCGTTCT TCTCCCACTC CAACGCACTT
GCCTTGTGTT CCCACCCCCG CAACGTCCCC GACGACGTGC TGCGGCGCGT GAAGGACACC
CAGGGCATCG TCATGGCCAC CTTCGTGCCC GGCTTCCTCA ACGAGGCGTG CAAGGAGTGG
ATGGACGCGC TGGAGGCCTA CGACGACAAG GCCCAGCTCG CGGTCGCCGA GGACGCCAAC
GAGGCGGGCT ACGAGGAGCG CAAGGCCCGC CGGGAGGCCT GGTTCGCGGC GAACCCCTGC
CCCGGAGCGT CAGTGTCTGA TGTGGCCGAT CACATCGACC ACATCCGCGA GATCGCCGGG
GTCGACTGCG TCGGTATCGG CGGCGACATG GACGGCATCG GCGCCACCCC CGAACAGCTC
AACGACGTCA CCGGCTACCC CAACCTCATC GGCGAACTCG CCTCCCGGAG CTGGAGCGAC
GACGACCTGG CCAAACTGAC CCGCCGCAAC GTGATCCGGG TGCTGCGCGA GACCGAGCGG
GTCGCCCAGG TCGCCCGGCA GCAGCGCGGC CCGTCCAACA AGACCATCGA GCAGCTGGAC
GGGGCCTAG
 
Protein sequence
MSTTASARAG ALLATAPVID GHNDLLIRMR GKVRYDFDAI DIAVDQTEHG LHTDIPRMRA 
GGMGGQFWSV FVPVSLQGEA AVTATLEQID GAHEMIGRYD DLALATTADE IDKAFSDGRI
ASLLGAEGGH SIADSLGTLR MMYRLGVRYM TLTHTSNTAW ADSATDAPVV GGLSEFGREV
VREMNRLGML VDISHVAPST MHAALDVSEA PAFFSHSNAL ALCSHPRNVP DDVLRRVKDT
QGIVMATFVP GFLNEACKEW MDALEAYDDK AQLAVAEDAN EAGYEERKAR REAWFAANPC
PGASVSDVAD HIDHIREIAG VDCVGIGGDM DGIGATPEQL NDVTGYPNLI GELASRSWSD
DDLAKLTRRN VIRVLRETER VAQVARQQRG PSNKTIEQLD GA