Gene Snas_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4666 
Symbol 
ID8885871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4976197 
End bp4977564 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content72% 
IMG OID 
ProductBeta-Ala-His dipeptidase 
Protein accessionYP_003513402 
Protein GI291302124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.976885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00791318 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCTAT CTCGCTCTGA TCTTCGCTCG GCCGTCGACG CGGGTATGCC CACCGTGATC 
GAGGACCTGA AACAGCTGGC CCGCATACCT TCGGTGGCTT TCGAGGGCTT CGACCACTCG
CACGTGACGC GCAGCGCCGA GGCGGTGGCC GAGCTGCTGC GAGGCGCCGG CATGGACGAT
GTGCGCATTG TCACGGCCAA GGGCGCCTCC GGGCAGACCG GCCAGCCCGC CGTCATCGGA
CGCAAGGCCG CCCCGGCGGG CGCGCCGCAC GTGCTGCTGT ACGCGCACCA CGACGTGCAG
CCCGCCGGTG ACTACGACGA CTGGGAGCAG GACGACCCGT TCGAGCCGCA GCTGCGCGGC
GAGCGGTTGT TCGGGCGCGG CTGCGCCGAC GACAAGGCCG GGGTGATGGC GCACGTCGCC
GCGCTGCGGG CCTTCGGCGA CGACCTGCCG GTGGGTGTGA CGGTGTTCGT CGAGGGCGAG
GAGGAGTTCG GCTCCGACTC GCTGGAGAAC CTGATCACCG AGAACCGGGA CCTGCTGGCC
GCCGACGTGA TCGTCATCGC CGACTCGGCC AACTGGGACG TCGGGCACCC GGCGCTGACC
ACGTCGCTGC GCGGCGTCAT CAACGCCTAC GTGGAGGTGC GCACCCTCAA CCAGGCGGTG
CACTCGGGCA TGTTCGGCGG CGCGGTTCCC GACGCGCTGA CCGCCCTGTG CCGTCTGCTG
GGCACGCTGC ACGACGAGAC CGGCGACGTC GCGGTCGAGG GCCTGAAGAC GGGCACGGCG
GCGCCGCTGG ACCTGCCCGA GGAGCGGCTG CGCGCCGAGT CGGGGATGGT CGACGGGGTC
GAGTTCATCG GCTCGGGTCG GCTGGTCGAG CGACTGTGGA CGAAGCCGAC GGCGACGGTG
CTGGGGATCG ACGCGCCCGG CGCCCGCGAG TCGGCCAACG CGCTGCAGCC GTCGGCGCGC
GCCAAGATCA GCCTGCGGCT GGCTCCCGGC GACGAGTCGG TCTCCGCGTT CGAGGCCGTG
AAGCGGCACC TGGAGGCGCG GGTACCGTGG GGTGCGCAGC TGACCGTCAC CCTCGACCAC
GGCGGCAACC CGTGCCAGAT CGACGCGCGT GGCGAACGCT ACGACGCCGC CCGGGCCGCG
TTCGCCGAGG CGTGGGACGG GGTGGAGCCG GTCGACATGG GTGTCGGCGG CGCGATCCCG
TTCATCGCGA CGTTCCAGGA GCTGTTCCCG GACGCGGCGA TCCTGGTGAC CGGCGTCGAG
GACCCGGACT CGCGGGCCCA CGGCCCCAAC GAGAGCCTGC ACCTGGCGGA GTTCACGCGC
GCCTGCCGCG CCGAGGCGCT GCTGCTGCAC AACCTCAGTG AGCTTTAA
 
Protein sequence
MSLSRSDLRS AVDAGMPTVI EDLKQLARIP SVAFEGFDHS HVTRSAEAVA ELLRGAGMDD 
VRIVTAKGAS GQTGQPAVIG RKAAPAGAPH VLLYAHHDVQ PAGDYDDWEQ DDPFEPQLRG
ERLFGRGCAD DKAGVMAHVA ALRAFGDDLP VGVTVFVEGE EEFGSDSLEN LITENRDLLA
ADVIVIADSA NWDVGHPALT TSLRGVINAY VEVRTLNQAV HSGMFGGAVP DALTALCRLL
GTLHDETGDV AVEGLKTGTA APLDLPEERL RAESGMVDGV EFIGSGRLVE RLWTKPTATV
LGIDAPGARE SANALQPSAR AKISLRLAPG DESVSAFEAV KRHLEARVPW GAQLTVTLDH
GGNPCQIDAR GERYDAARAA FAEAWDGVEP VDMGVGGAIP FIATFQELFP DAAILVTGVE
DPDSRAHGPN ESLHLAEFTR ACRAEALLLH NLSEL